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(54) Title: HIV ENVELOPE POLYPEPTIDES 

(57) Abstract 

A method for die rational design 
and preparation of vaccines based on HIV 
envelope polypeptides is described. In 
one embodiment, the method for mak- 
ing an HIV gp!20 subunit vaccine for a 
geographic region comprises determining 
neutralizing epitopes in the V2 and/or C4 
domains of gpl20 of HIV as depicted in 
the figure. In a preferred embodiment of 
the method, neutralizing epitopes for the 
V2, V3 and C4 domains of gpl20 are 
determined. Also described are DNA se- 
quences encoding gp!20 from preferred 
vaccine strains of HIV. 
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HIV ENVELOPE POLYPEPTIDES 



FIELD OF THE INVENTION 

This inyention : relates to the rational design and 
preparation of HIV vaccines based on HIV envelope 
polypeptides and the resultant vaccines. This 
invention further relates to improved methods for HIV 
serotyping and immunogens which induce antibodies 
useful in the serotyping methods. 

BACKGROUND OF THE INVENTION 

Acquired immunodeficiency syndrome (AIDS) is 
caused by a retrovirus identified as the human 
immunodeficiency virus (HIV) . There have been intense 
effort to develop a vaccine. These efforts have 
focused on inducing antibodies to the HIV envelope 
protein. Recent efforts have used subunit vaccines 
where an HIV protein, rather than attenuated or killed 
virus, is used as the immunpgen in the vaccine for 
safety reasons. Subunit vaccines generally include 
gpl2 0 , the portion of the HIV envelope protein which is 
on the surface of the virus. 

The HIV envelope protein has been extensively 
described, and the amino acid and RNA sequences 
encoding HIV envelope from a number of HIV strains are 
known (Myers, G. et al. , 1992 . Human Retroviruses and 
AIDS. A compilation arid analysis of nucleic acid and 
amino acid sequences. Los Alamos National Laboratory, 
Los Alamos, New Mexico) . The HIV envelope protein is a 
glycoprotein of about 160 kd (gpl60) which is anchored 
in the membrane bilayer at its carboxyl terminal 
region . The N-tenniha 1 segment , gp!20, protrudes into 
the aqueous environment surrounding the virion and the 
C-temainal segment, gp41 , spans the membrane. Via a 
host-ceil mediated process , gpl60 is cleaved to form 



gpl20 iari<^ As : theire 

is no coy a i ierit a tt a chm^n t :biefcw6^^ 
gpl2 0' ' ■■is- re leased ■ : frcin thejsurfac^^ ■ : ■ and 

irif ected eel is . :: ' 

The gp!2 0 : molec^ a polypept ide core 

of 60,000 daltons which is fextensiv^ 
N- linked glycosylation to iricr ease the apparent 
molecular weight of the molecule to 120,000 daltons. 
The amino acid sequence of gp!20 contains five 
relatively conserved domains interspersed with five . 
hypervariable domains. The positions of the 18 
cysteine residues in the gpl20 primary sequence, and 
the positions of 13 of the approxiiiately 24 N- linked 
glycbsylation sites in the g^l2 0 sequence are common to 
all gpl20 sequences. The hypervariable domains contain 
extensive amino acid substitutions, insertions and 
deletions . Sequence var iat i ons in tties e doma i hs result 
in up to 3 0% overall sequence variability between gpl20 
molecules from the various viral isolates. Despite 
this variation, all gpl20 sequences preserve the 
viruses ability to bind to the viral receptor CD4 and 
to interact with gp41 to induce fusion of the viral and 
host cell- membranes-. 

gpl20 has been the ob j ect of intensive 
investigation as a va cc ine c and idat e f or subunit ; . 
vaccines , as the viral protein; which is mb£t likely to 
be accessible to immune attack . gp!2 0 is considered to 
be a good candidate for a subiiriit vaccine /because 
(i) gpl20 is known to possess the CD4 binding domain by 
which HIV attaches to its target c^ 

inf ect ivity can be neutralized in vitro by antibodies 
to gp 120, (iii) the majority o^ 
neutralizing activity p^ 

infected individuals can be removed with a gpl20 
affinity column, and (iv) the gpl20/gp4i complex 
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appears to be essential for the 'tr-msmLpsXptiJef: HIV by 
cell-to-cell fusion; 

The identif icatibh of epitbp recb^ized by vir^ 
neutralizing ant ibbdiesy i for 1Ae rati 

5 design of vaccines effective against HiV-i infection. 
One way in which antibodies would be expected 
neutralize HIV-1 infection is by blocking the binding 
of the HIV- 1 envelope glycoprotein, gpl20, to its 
cellular receptor, CD4 . However, it has been 

10 surprising that the CD4 blocking activity, readily 
demonstrated in sera from HIV-1 infected individuals 
(31, 44) and animals immunized with recombinant 
envelope glycoproteins (1-3), has not always correlated 
with neutralizing activity (2, 31, 44) .Results 

15 obtained with monoclonal antibodies have shown that 

while some of the monoclonal antibodies that block the 
biriding of gpl20 to CD4 possess neutralizing activity, 
others do not (4, 7, 16, 26, 33, 35, 43, 45) • When the 
neutralizing activity of CD4 blocking monoclonal 

20 antibodies are compared to those directed to the 

principal neutralizing determinant (PND) located in the 
third variable domain (V3 domain) of gpl20 (10, 39), 
the CD4 blocking antibodies appear to be significantly 
less potent. Thus, CD 4 blocking monoclonal antibodies 

25 typically exhibit 50% inhibitory concentration values 
• in the 1^10 /xg/ml range (4, 16, 26, 33, 35, 43, 

45) whereas PND directed monoclonal antibodies 
typically exhibit IC 50 values in the 0 .1 to 1. 0 /ig/ml 
range (23, 33, 42) . 

30 Subunit vaccines, based on gp!20 or another viral 

protein, that can effectively induce antibodies that 
neutralize HIV are still being sought. However, to 
date no vaccine has not been effective in conferring 
protection against HIV infection . 
■ 35 .'; ' ; 
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DESCRIPTION OF THE BACKGROUND ART 

Recombinant subunit vaccines are described in 
: : :Berman ; et al . , PCT /US 9 1/0 2 2 5 0 (publ ished as number 
: ym9T/1522B on 17 October 1991) . See also .>/ : ;« ; ^V : .. : fiU. et 
a (vaccih^ 
envelope recombinant vaccine) ; Arthur etal^y Ji Virol . 
63(12) : 5046-5053 (1989) (purified gp!20) ; and Bennan 
et aii:; Pfoc. Natl. Acad. Sci. USA 85:5200-5204 (1988) 
(recombinant envelope glycoprotein gp!20) . 

10 Numerous sequences for gp!20 are known. The 

sequence of gpl20 from the IIIB substrain of HIV- 1^ 
referred to herein is that determined by Muesing et 
al. , "Nucleic acid structure and expression of the : 
human AIDS/lymphadenopathy retrovirus, Mature 

15 313:450-458 (1985). The sequences of gpl20 from the 
NY-5, Jrcsf , Z6, 2321, and HXB2 strains of HIV-1 are 
listed by Myers et al., ''Human Retroviruses and AIDS ; A 
compilation and analysis of nucleic acid and amino acid 
sequences , " Los Alamos National Laboratory, Los Alamos, 

20 New Mexico (1992) . The sequence of the Thai isolate 
: A244 is provided by McCutchan et al . , "Genetic Variants 
of HlVrl: in Thailand, 11 AIDS Res. and Human Retroviruses 
8:1887-1895 (1992) . The MN 19W clone is described by 
Gurgo et ai . , "Envelope sequences of two new United 

25 States HIV-l isolates," Virol . 164: 531-536 (1988) . 
The amino acid sequence of this MN clone differs by 
approximately 2% from the MN-gpI20 clone (MN^) 
disclosed herein and obtained by Berman et al. 
Each of the above-described references is 

30 incorporated herein by reference in its ehtiirety. 



SUMMARY OF THE INVENTION 

The present invention provides a method for the 
rational design and preparation of vaccines based on 
35 HIV envelope polypeptides. This invention is based on 
the disc very that there are neutralizing epitopes in 
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the V2 and C4 domains of gpl20, in addition to the 
neutralizing epitopes: in the; V3 domain. .. In addition/ 
the ^ouht of variation of the neutral i z ing epitopes is 
: highly constrained/ facilitating the design of an HI^ , 
5 subunit vacc ine that can induce antibodies that 

neutr al iz e vs piurail i ty of HIV strains for a .given ^.y^' 
geographic region.: 

In one embodiment, the present invention provides 
a method for making an HIV gpl20 subunit vaccine for a 

10 geographic region in which a neutralizing epitope in 

the V2 and/or C4 domains of gp!20 of HIV isolates from 
the geographic region is determined and an HIV strain 
having gpl20 which has a neutralizing epitope in the V2 
or C4 domain which is common among isolates in the 

15 geographic region is selected and used to make the 
vaccine. ■ ■ 

in a preferred embodiment of the method,: 
neutralizing epitopes for the V2, V3, and C4 domains of 
gpl20 from HIV isolates from the geographic region are 

20 determined. At least two HIV isolates having different 
neutralizing epitopes in the V2> V3> or C4 domain are 
selected and used to make the HIV gp!20 subunit 
vaccine. Preferably , each of the selected isolates 
have one of the most common neutralizing epitopes for 

25 the V2, V3 # or C4 domains . • 

The invention also provides a multivalent HIV 
gp!20 subunit vaccine. The vaccine comprises gp!20 
from two isolates of HIV having at least one different 
neutralizing epitope . Preferably , the isolates have 

30 the most common neutralizing epitopes in the geographic 
region for one of the domains. 

A DNA sequence of less than 5 kilobases encoding 
-gp-120 from preferred vaccine strains of HIV f GNiE s and 
GNE l6 , expression construct comprising the GNE$-gpi2b and 

35 GNE l6 -gpl20 encoding DNA under the transcriptional and 
trans lational control of a heterolog us promoter, and 
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isolated GNEg^gpi2 6 and GNE I6 ^gpl20 are also provide^; . : 
The invention £ur ther provides improved methods f br HIV 
serotyping in which epitopes in the V2 or C4 domaiins of 
gp!20 are d^ and provides immunogens (truncated 

gpl2 0 sequences ) which induce antibodies useful in the 
serotyping methods. 



BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 describes inhibition of CD4 binding by 

10 monoclonal antibodies to recombinantly produced gp!20 
from the MN strain of HIV (MN-rgpl20) . Mice were 
immunized with MN-rgp 120 and the resulting splenocytes 
were fused with the NP3X63.Ag8.653 cell line as 
described in Example 1. Thirty-five stable hybridoma 

15 clones, reactive with MN-rgpl20 were identified by 

ELISA. Secondary screening revealed seven cell lines 
(1024;, 1093/ 1096, 1097, 1110 , 1112 , and 1027) 
secreting antibodies able to inhibit the binding of 
MN-rgpl20 to bidtin labeled recombinantly produced CD4 

20 (rsCD4) in a ELISA using HRPO-strepavadin. Data 
obtained with monoclonal antibodies from the same 
fusion (1026, 1092, 1126) that failed to inhibit 
MN-rgpl20 binding to CD4 is shown for purposes of 
• ; comparison. • 

25 FIGU^ 2 shows neutralizing activity of CD4- 

blocking monoclonal antibodies to MN-rgpl20 . 
Monoclonal antibodies that blocked the binding of : 
MN-rgp 120 to CD4 were screened for the capacity to 
inhibit the infection of MT2 cells by the MN strain of 

30 HIV-l in vitro. Cell free virus was added to wells 

containing serially: diluted antibodies and incubated at 
4°C for 1 hr . After incubation, MT-2 cells were added 
: : -..'to the wells and the cultures were then grown for 5 
days at 37°C. Cell viability was then measured by 

35 addition of the color imetric tetrazolium compound MTT 
as described in reference (35) of Example 1. The 
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■ optical densities f each well were measured at 540 ran 
• ■ using a micrdtiter palate ;: reading spectrophotoiiiet "rvr 
Inhibition of virus infectivity was calculated by 
dividing the mean bjitidil densities from wells 
5 containing monoclonal; antibodies by the mean value of 
wells that received viruis alone * Monoclonal antibodies 
that blocked GD4 binding are the same as those 
indicated in Figure Legend 1. Data from the V3- 
directed monoclonal antibody to MN-rgpl20 (1034) is 
10 provided as a positive control. Data obtained with the 
V3 directed monoclonal antibody, 11G5, specific for the 
IIIB strain of HIV-l (33) is shown as a negative 
control . 

FIGURE 3 is a diagram of gpl20 fragments used to 

15 localize the epitopes: recognized by the CD4 blocking 
monoclonal antibodies to MN-rgpl20 • A series; of 
fragments (A) corresponding to the V4 and C4 domains 
(B) ( SEQ . ID. NO. 14) of the gene encoding MN-rgpl20 
were prepared by PGR. The gpl20 gene fragments were 

20 fused to a fragment of the gene encoding Herpes Simplex 
Virus Type 1 glycoprotein D that encoded the signal 
sequence and 25 amino acids from the mature amino 
terminus. The chimeric genes were assembled into a 
mammalian cell expression vector (PRK5) that provided a 

25 GMV promoter, translatipnal stop codons and an SV40 
polyadenylation site V- The embryonic human kidney 
adenocarcinoma cell line, 293s, was transfected with 
the resulting plasmid and recombinant proteins were 
recovered from growth conditioned cell culture medium. 

30 Fragments of MN-rgpl2d, expressed as HSV-1 Gd fusion 
proteins , were produced by transient transf ection of 
293s cells (Example 1) . To verify expression , cells 
were metabolically labeled with [ 35 S ] -methionine , and 
the resulting growth conditioned cell culture 

35 supernatants were immunoprecipitated (C) using a 
monoclonal antibody ; 5B6> sp cific f r thiet amino 
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; terminus of HSV^l Gd and fixed S. aureus. The 
x : immunpprecipitated proteins were: resolved: on 4 to 20 % 
; acryiamide gradient gels using and visualized 

by autoradiography. The samples were : Lane 1, 
5 Fm.^ lane ^ 3 > ■■:.F^;-^9r443;;, 

lane 4, FMN . 414-451; lane 5, MN-rgpl20. The gel 
demonstrated that the proteins were expressed and 
migrated at the expected molecular weights. 

FIGURE 4 shows a C4 domain sequence comparison 
10 (SEQ. ID. Nos. 3-13). The C4 domain amino acid 

sequences of recombinant and virus derived gpl20s used 
for monoclonal antibody binding studies were aligned 
starting the amino terminal cysteine. Amino acid 
positions are designated with respect to the sequence 
15 of MN-rgpl20. Sequences of the LAI substrains , IIIB , 
BH10, Bru, HXB2 , and HXB3 are shown for purposes of 
comparison. 

FIGURE 5 shows sequences of C4 domain mutants of 
MN-rgpl20 (SEQ. ID. Nos. 3 and 15-23) . Nucleotide 

20 substitutions, resulting in the amino acid sequences 
indicated, were introduced into the C4 domain of 
MN-rgpl20 gene using recombinant PGR. The resulting 
variants were assembled into the expression plasmid, 
pRK5, which was then transf ected into 293s cells. The 

25 binding of monoclonal antibodies to the resulting C4 
domain variants was then analyzed (Table 5) by ELISA. 

FIGURE 6 illustrates the reactivity of monoclonal 
antibody 1024 with HIV-1^ substrains. The cell 
surface binding of the C4 domain reactive monoclonal 

30 antibody 1024 v to H9 ceils chronically infected with the 
IIIB, HXB2, HXB3 , and HXB10 substrains of HIV-1 LAI or 
HIV-1MN was analyzed by flow cytometry. Cultures of 
virus infected cells were reacted with either 
monoclonal antibody 1024 , a nonrelevant monoclonal 

35 antibody (control) , or ay broadly cross reactive 
monoclonal antibody (1026) raised against rgpl20 . 
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After washing away unbound monoclonal antib dy, the •.: 
cells were then labeled with /fluorescein conjugated 
goat antibody to mouse IgG (Fab 1 ) 2 , washed and fixed 
with par a forma Idetiyde ^ The jtibsuI t ing cells ; were 
analyzed for degree of fiuorescence intensity using a 
FACSCAN (Beet on Dickenson; Fuilerton > CA) . 
Fluorescence was measured; as mean intensity of the 
cells expressed as mean channel number plotted on a log 
scale. 

FIGURE 7 shows the determination of the binding 
affinity of monoclonal antibodies for MN-rgpl20. CD4 
blocking monoclonal antibodies raised against MN-rgpl20 
(1024 and 1097) -or IIIB-rgpl20 (13H8 and 5C2 ) were • 
labeled with [ I25 I ] and binding titrations using 
MN-rgpl20 (A and B) or IIIB-rgpl20 (C and D) were 
carried out as described in the Example li A> binding 
of monoclonal antibody 1024 ; B binding of monoclonal 
antibody 1097; C, binding of monoclonal antibody 13H8; 
and D binding of monoclonal antibody 5C2. 

FIGURE 8 shows the correlation between gpl20 
binding affinity (K^) and neutralizing activity (IC50) 
of monoclonal antibodies to the C4 domain of MN-rgpl20. 
Binding affinities of monoclonal antibodies to the C4 
domain of gpl20 were determined by Scat chard analysis 
(Figure 9 , Table 5) . The resulting values were plotted 
as a function of the log of their neutralizing^ 
activities (IC^) determined in Figure 2 arid Table 6. 

FIGURE 9 depicts the amino acid sequence of the 
mature envelope glycoprotein ( gp!2 0 ) from the 
clone of the MN strain of HIV-l (SEQ« ID. NO. 1) . 
Hypervariable domains are from 1-29 (signal sequence) , 
131-156, 166-200,305-332, 399^4 13 , and 460-469. The V 
and C regions are indicated (according to Modrow et 
al . f J. Virology 61 (2) : 570 (1987) . Potential 
glycosyiation sites are marked with a (*) . 
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FIGURE 10 depicts the amino acid sequence of a 
f visi n prbtein of ; the resiidues 4 1-511 of the matur 
envelope glycoprotein (gp!20) from the MNg^ clone of 
th^ MN strain of HIV-^lv and ythe gD-1 amino terminus 
5 from the herpes simplex; glycoprotein gD-1 . (SEQ.: ID* 
NO. 2) . The V and € regions are indicated (according 
to Modrow et al., J .Virology €1(2) : 570 (1987). 
Potential glycosylation sites are marked with a (*) . 

10 : DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a method for the 
rational design and preparation of vaccines based on 
* HIV envelope polypeptides. This invention is based on 
the discovery that there are neutralizing epitopes in 

15 the V2 and C4 domains of gpl20, in addition to the 

neutralizing epitopes in the V3 domain. Although the 
amino acid sequences of the neutral iz ing epitopes in 
the V2 , V3, and C4 domains are variable, it has now 
been found that the amount of variation is highly 

20 constrained. The limited amount of variation 

facilitates the design of an HIV subunit vaccine that 
can induce antibodies that neutralize the most common 
HIV strains for a. given geographic region. In 
particular, the amino acid sequence of neutralizing 

25 epitopes in the V2, V3 , and C4 domains for isolates of 
a selected geographic region is determined . gpl20 from 
isolates having the most common neutralizing epitope 
sequences are utilized in the vaccine. 

The invention also provides a multivalent gpl20 

30 subunit vaccine wherein gp!20 present in the vaccine is 
from at least two HIV isolaites which have different 
amino acid sequences for a neutralizing epitope in the * 
V2> V3 f or C4 domain of gpl20. The invention further 
provides improved methods for HIV serotyping in which > 

35 epitopes in the V2 or C4 domains of gp!20 are 
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determined arid provides immunogens which induce 
antibodies usef ul in the s rotting methods, 

The tera ^stibuhit vaccine^: is used herein, as in 
the art> to ref er to a viral vaccine that does not ; 
5 contain Jviriisy but rather contains : one or more viral > 
proteins or fragments of viral proteins .. As used 
herein, the term "multivalent" means that the vaccine 
contains gp!20 from at least two HIV isolates having 
different amino acid sequences for a neutralizing 
10 ' epitope. 

Vaccine Design Method 

The vaccine design method of this invention is 
based on the discovery that there are neutralizing 

15 epitopes in the V2 and C4 domains of gpl20, in addition 
to those found in the principal neutralizing domain 
(PND) in the V3 domain. Selecting an HIV isolate with 
appropriate neutralizing epitopes in the V2 and/or C4 
domains provides a vaccine that is designed to induce 

20 immunity to the HIV isolates present in a selected 
geographic region, in addition, although the amino 
acid sequence of the V2 , V3 , and C4 domains containing 
the neutralizing epitopes is variable , the amount of 
variation is highly constrained , facilitating the 

25 design of a multivalent vaccine Which can neutralize a 
plurality of the most common HIV strains for a given 
geographic region . 

The method for making an HIV gp!20 subunit vaccine 
depends on the use of appropriate strains of HIV for a 

30 selected geographic region . Appropriate strains of HIV 
for the region are selected by determining the 
neutralizing epitopes for HIV isolates and the 
percentage of HIV infections attributable to each 
strain present in the region. HIV strains which have 

35 the most common neutralizing epitopes in the V2 or C4 
domains in the geographic region are selected. , 
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Preferably , isolates that cbnf er protection against the 
most: dbromoh neutrali2iri<g epitopes in the v ;V%^^^ : -:^d C4 
domains for a geographic region are selected. 

•.•••One' embodiment of the method for making: an HIV : 
5 gpi20 subunit vaccine from appropriate strain^ 

for a geographic region comprises the f olibi/inij steps . 
A neutralizing epitope in the V2 or C4 domain of gpl2b 
of HIV isolates from the geographic region is 
determined. An HIV strain having gpl20 with a 

10 neutralizing epitope in the V2 or C4 domain that is 

common among HIV isolates in the geographic region is 
selected. gpl20 from the selected isolate is used to 
make an HIV gpl20 subunit vaccine. 

In another embodiment of the method, the 

15 neutralizing epitopes in the V2, V3, and G4 domains of 
gp!20 from HIV isolates from the geographic region are 
; determined. At least two HIV isolates having different 
neutralizing epitopes in the V2, V3, or C4 domain are 
selected and used to make an HIV gpl20 subunit vaccine. 

20 Preferably, the vaccine contains gpl20 from at least 
the two or three HIV strains having the most common 
neutralizing epitopes for the V2 , V3, or C4 domains . 
More preferably, the vaccine contains gp!2 0 from 
sufficient strains so that at least about 50%, 

25 preferably about 70%, more preferably about 80% or more 
of the neutralizing epitopes for the V2 , V3 , and €4 
domains in the geographic region are included in the 
vaccine. The location of the neutralizing epitopes in 
the V3 region are well known. The location of the 

30 neutralizing epitopes in the V2 and C4 regions are 
described hereinafter. 

Each of the steps of the method are described in 
detail below. ■ 

35 Determining neutralizing epitopes 
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The f ir s t step in designing a Vaccine a 
selected geographic region is to determine the; 
neutralizing epitopes in the gp!20 V2. ahd/or 
doisiih^v:: : ih: a preferred embodiment , neutralizing: 
5 domain (the principal neutralizing 

domain) are also determined. The location of 
neutralizing epitopes in the V3 domain is well known . 
Neutralizing epitopes in the V2 and C4 domains have now 
been found to be located between about residues 163 and 

Id 200 and: between about residues 420 and 440, 

respectively. In addition, the critical residues for 
antibody binding are residues 171, 173, 174, 177, 181, 
183, 187 , and 188 in the V2 domain and residues 429 and 
432 in the C4 domain, as described in detail in the 

15 Examples . 

The neutralizing epitopes for any isolate can be 
determined by sequencing the region of gpl20 containing 
the neutralizing epitope. Alternatively, when 
antibodies specific for the neutralizing epitope, 

20 preferably monoclonal antibodies, are available the 

neutralizing epitope can be determined by serological 
; methods as described hereinafter. A method for 
identification of additional neutralizing epitopes in 
gpl20 is described hereinafter. 

25 acid sequences of 

various isolates and strains of HIV, the most common 
numbering system refers to. the location of amino acids 
within the gpl20 protein using the initiator methionine 
residue as position i. The amino acid numbering 

30 reflects the mature HIV-1 gpl20 amino acid sequence as 
shown by Figures 9 and Fig * 10 [SEQ. ID Nos. 1 and 2] . 
For gpl20 sequences derived from other HIV isolates and 
which include their native HIV N-termihal signal 
sequence, numbering may differ. Although the 

35 nucleotide and amino acid residue numbers : ma^y not be 

applicable in other strains where upstream deletions or 
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insertions change the length of the viral genome : and 

•••.^•''••••':gpi2.b>: the region enc ding the portions of gp 1 2 0 is 
readily identified by reference to the teachings 
hereih. The variable (V) domains and conserved (G) 
5 doma iris of gpl 2 0 are specif ied according to the > 
rioroehclature of Modrow et al . •■Computer-assistiad 
analysis of envelope protein sequences of seven human 
immunodeficiency virus isolates: predictions of 
antigenic epitopes in conserved and variable regions, " 

10 J.Virbl. 61:570-578 (1987) v 

The first step in identifying the neutralizing 
epitopes for any region of gpl20 is to immunize an 
animal with gpl20 to induce anti-gp!20 antibodies. The 
antibodies can be polyclonal or, preferably, 

15 monoclonal . Polyclonal antibodies can be induped by 
administering to the host animal an immunogenic 
composition comprising gp!20. Preparation of 
immuhbgenic compositions of a protein may vary 
depending on the host animal and the protein and is 

20 well known. For example, gpl20 or an antigenic portion 
thereof can be conjugated to an immunogenic substance 
such as KLH or BSA or provided in an adjuvant or the 
like. The induced antibodies can be tested to 
determine whether the composition is specif ic for 

2:5 X,..;^:l2'0:. : '-:'':If a polyclonal antibody composition does not 
provide the desired specificity, the antibodies can be 
fractionated by ion exchange chromatography and 
immunoaf f inity methods using intact gpl20 or various 
fragments of gpl20 to enhance specif iclty by a variety 

30 of conventional methods . For example, the composition 
can be fractionated to reduce binding to other 
substances by contacting the composition with gpl20 
affixed to a solid substrate. Those antibodies which 
bind to the substrate are retained. Fractionation 

35 techniques using antigens affixed to a variety of solid 
substrates such as affinity chromatography materials 
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ihbl^^ se and the like are well 

Mono ariti-gpl2b antibodies can be produced 

by a numbfer of conventional Methods A mouse can : be 
5 in j ected with an immunogenic coinposition containing v 
gpl20 eihd spleen cells obtained. Those spleen cells 
can be fused with a fusion partner to prepare 
hybridomas. Antibodies secreted by the hybridomas can 
be screened to select a hybridoma wherein the 
10 antibodies neutralize HIV infectivity, as described 

hereinafter. Hybridomas that produce antibodies of the 
desired specificity are cultured by standard 
techniques. 

Infected human lymphocytes can be used to prepare : 

15 human hybridomas by a number of techniques such as 

fusion with a murine fusion partner or transformation 
with EBV. In addition, combinatorial libraries of 
human or mouse spleen can be expressed in E. coli to 
produce the antibodies . Kits for preparing 

20 comb ina tor ia 1 libraries are commercially available* 
Hybridoma preparation techniques and culture methods 
are vre 11 known and constitute no part of the present 
invention. Exemplary preparations of monoclonal 
antibodies are described in the Examples^/ 

25 Following pr epar at i oh of anti-gpl20 monoclonal 

antibodies, the antibodies are screened to determine 
those antibodies which are neutralizing antibodies. 
Assays to determine whether a monoclonal antibody 
neutralizes HIV infectivity are well known and are 

30 described in the literature . Briefly, dilutions of 

antibody and HIV stock are combined and incubated for a 
time sufficient for antibody binding to the virus. 
Thereafter, cells that are susceptible to HIV infection 
are combined with the virus /antibody mixture and 

35 cultured. MT-2 cells or H9 ceils are susceptible to 
infection by m st HIV strains that are adapted for 
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growth in ; the laboratory . Activated peripheral blpq& 
mononucieair cells or macrophages can be 

infected with primary isolates ( isolates from a; patient 
specimens ^ not been cultured in t^celi; line^ 

5 or transf ormed cell lines) . Daar et al , Proc. Natl . :f 
Acad. Sci. C75A:V87: 6574-6578 (1990); describe methods" for 
infecting cells with primary isolates. 

After culturihg the cells for about five days, the 
number of viable cells is determined, as by measuring 

10 metabolic conversion of the fbrmazan MTT dye. The 

percentage of inhibition of inf activity is calculated 
to determine those antibodies that neutralize HIV. An 
exemplary preferred procedure for determining HIV 
neutralization is described in the Examples. 

15 Those monoclonal antibodies which neutralize HIV 

are used to map the epitopes to which the antibodies 
bind. To determine the location of a gpl20 
neutralizing epitope, neutralizing antibodies are 
combined with fragments of gpl20 to determine the 

20 fragments to which the antibodies bind. The gpl20 

fragments used to localize the neutralizing epitope^ 
are preferably made by recombinant DNA methods as 
described here inafter and exemplified in the Examples. 
By using a plurality of fragments, each encompassing 

25 different, ovierlap^ portions of gp!20, an amino acid 
sequence encompassing a neutralizing epitope to which a 
neutraliz ing antibody binds can be determined. A 
preferred exeinpiary d^tera of the neutralizing 

epitopes to which a series of neutralizing antibodies 

30 binds is described in detail in the Examples. 

This use o£ overlapping fragments can narrow the 
location of the epitope to a region of about 20 to 40 
residues. To confirm the location of the epitope and 
narrow l^e iocatioh to a region of about 5 to 10 

35 residues , site-directed mutagenicity studies are 

preferably perform d . Such studies can also det rmine 
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the critical residues of neutralizing::,: 

: antibodies> A; preferred exemplary site-directed 

mutagenicity proced^ is described in the Examples. 
To petiqtm site-directed mutagenicity studied, ; 
5 recombinant: tectoiques can be utilized to introduce 
single amino acid ^substitutions at selected sites into 
gpl20 fragments containing the neutralizing epitope . 
Briefly, overlapping portions of the region containing 
the epitope are amplified using primers that 

10 incorporate the desired nucleotide changes. The 

resultant PGR products are annealed and amplified to 
generate the final product. The final product is then 
expressed to produce a mutagen i zed gpl20 fragment. 
Expression of DNA encoding gp!20 or a portion thereof 

15 is described helre inif tisr and exemplified in the 

Examples • • 

In a preferred embodiment described in Example 1, 
the gpl20 fragments are expressed in mammalian cells 
that are capable of expression of gp!20 fragments 

20 having the same glycolsy lation and disulfide bonds as 
native gpl20. The presence of proper glycolsy lation 
and disulfide bonds provides fragments that are more 
likely to preserve the neutralizing epitopes than 
fragments that are expressed in E. coli, for example, 

25 which lack disulfide bonds and giycosylation or are 
chemically synthesized which lack glycolsylation and 
may lack disulfide bonds. 

Those mutagenized gpl20 fragments are then used in 
an immunoassay using gpl 20 as a control to determine 

30 the mutations that impiair or eliminate binding of the 
neutralizing antibodies. Those critical amino acid 
residues form part bf : the neutralizing epitope that can 
only be altered in limited ways without eliminating the 
epitope . Each alteration that preserves the epitope 

35 can be deterained. Such mutagenicity studies 

dem nstrate the variations in the amino acid sequence 
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of the neu tr a 1 i z ing ep it ope that provide equivalent or 
diminished binding rfayVne^ 

eliminate antibody binding. Although the amino acid 
sequence of gp!20 usied Iri the vaccine preferably 16. 
5 identical to that of a selected HIV isolate for the 
given geographic region; alterations in the amino acid 
sequence of neutralizing epitope that are suitable for 
use in a vaccine can be determined by such studies. 
Once a neutralizing epitope is localized to a 
10 region of ten to twenty : amino acids of gp!20, the amino 
acid sequence of corresponding neutralizing epitopes of 
other HIV isolates can be determined by identifying the 
corresponding portion of the gpl20 amino acid sequence 
of the isolate. 

15 Once the neutralizing epitopes for a given region 

of gpl20 are determined^ the amino acid sequence of HIV 
isolates for the geographic; region are determined. The 
complete amino acid sequence for numerous isolates has 
been determined and is available from numerous journal 

20 articles and in databases. In such cases, 

determination of the amino acid sequence of HIV 
isolates for the geographic region involves looking up 
the sequence in an appropriate database or journal 
article. However ; for some isolates , the amino acid 

25 sequence information does not include the sequence of 
the V2 or C4 domains. 

When the amino acid sequence of a region of 
interest for a given isolate is not known, the amino 
acid sequence can be determined by well known methods. 

30 Methods for determining the amino acid sequence of a 
protein or peptide of interest are well known and are 
described in numerous references including Maniatis et 
al. /Molecular :Cldning--A Laboratory Manual, Cold 
Spring Harbor Laboratory ( 1984). In addition, 

35 automated instruments which sequence proteins are 
commercially available. 
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Alternatively , ,:the -nucieptid^ • 
encoding gpl2 0 or a relevant ; port;io^;Oje : ; gpl? p can be 
determined and the amino acid sequence o^ can be 

deduced . Methods for amplifying gpl20-encoding DNA 
5 from HIV isolates to provide sufficient DNA for 

sequencing are well known v In partictiiar , Ou et al 7 
Science 256 : 1165-117 1 ( 1992 ) ; Zhang ^tal. AIDS 
. : 5:675-681 (1991) ; and Wolinsky Science 255 : 1134-1137 
(1992) describe methods for amplifying gpl20 DNA, 
10 Sequencing of the amp 1 i f i ed DNA is we 11 known and is 
described in Maniatis et al. , : Molecular Cloning--A 
Laboratory Manual , Cold Spring Harbor Laboratory 
(1984 ) and Horvath et al. ,. An Automated DNA 
Synthesizer Employing Deoxy nuc leos ide 
15 3 ' -Phosphor amid it es, Methods in Erizymology 154 : 

313-326, (1987) , for example. In ^ddit:ioh/ automated 
V instruments that sequence DNA are c ommer c i al ly 
available. 

20 in a preferred embodiment, isolate is a 

patient isolate which has not been passaged in culture. 
It is known that following passage in T-dells ; HIV 
isolates mutate and isolates best suited for growth 
under cell culture conditions are selected. For 

25 ^ example , cell culture strains of H±V develop the 

ability to form syncytia . Therefore ; preferably the 

: : amino acid sequence of gp!2 0 is determined from a 

patient isolate prior to growth in ^ Generally , 

DNA from the isolate is amplified to provide suf f ic ient 

30 DNA f or sequencing. The deduced amino acid sequence is 
used as the amino acid s equenee of the is 61a te , as 
des cr ibed her e inbe fore. .:• 

To determine the percentage each isolate 
cons t itutes of total HIV that infects individu^ in 

35 the geographic region, standard lepidemiological methods 
: are used . In particular / suf ficiiant ; isolates are 
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seguenced to ensure confidence that the percentage of 
each; isolate in the geographic region has been 
determined. For example, Ichimura et al> AII)S Res . 
, flujn ; ; JR^ 10 : 263-^269 ( 1994) describe an 

5 epidemiological study in Thailand that determined that 
'there-'-ar^ two strains of HIV present in the region . 
HIV strains have only recently been present in Thailand 
and Thailand, therefore has the most homogenous 
population of HIV isolates known to date. The study 

10 : sequenced 23 isolates from various parts of the country 
and determined that only two different amino acid 
sequences were present in the isolates. 

In contrast, HIV has been infecting individuals in 
Africa for the longest period of any geographic region . 

15 In Africa, each of the most common isolates probably 

constitutes about 5% of the population. In such cases, 
more isolates would need to be sequenced to determine 
the percentage each isolate constitutes of the 
population. Population studies for determining the 

20 percentage of various strains of HIV, or other viruses , 
present in a geographic region are well known and are 
described in, for example, Ou et al , Lancet 341 : 1171- 
1174 (1993 ) ; Ou et al , AIDS Res . Hum . Retroviiruses 
;: 8:1471-1472 (1992) ; and McCutchan et al. , AIDS Res, 

25. : Htiih . a Re'troviiruses .8! 18 8 7 ^1 8 9 5 (1992) . , ;.- : ,v 

: In the United States and western Europe, probably 
about two to four different neutralizing epitopes in 
each of thie V2 , V3 , and C4 domains const i tut e 50 to 70% 
of the neutralizing epitopes for ea ch domain in the 

30 geographic region, as described more fully hereinafter . 



Selection method 

Once the amino acid sequence of neutralizing 
epitopes for strains in a region are determined , gpl2 0 
35 from an HIV strain having gp!20 that has an amino acid 
sequence for a neutralizing epitope in the V2 or C4 
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; doirtain which sequence is one of the most cbiimoh in the 
gebcjfjraphib region is selected. One of the most common 
rieutralizing epitope amino acid sequences means that 

: v.' : 7: ^ ' ; V|l^^iB^ai-h -.'has' ah amino acid sequence for at least one 

: 5 ;: neutralizing epitope that is occurs among; the most 
y/frequently for HIV isolates in the geographic region 
; and thus is present as a significant percentage of the 
population. For example, if there are three sequences 
for a neutralizing epitope that constitute 20, 30, and 

10 40 : percent of the sequences for that epitope in the 

region and the remainder of the population is comprised 
by 2 to 4 other sequences, the three sequences are the 
most common. Therefore, in African countries, if each 
of several amino acid sequences constitute about 5% of 

15 the sequences for a neutralizing epitope and the 

remainder of the sequences each constitute less than 1% 
of the population, the isolates that constitute 5% of 
the population are the most common. 

Preferably, isolates having the most common amino 

20 acid sequences for a neutralizing epitope are chosen. 
By the most common is meant that the sequences occur 
most frequently in the geographic region • For example , 
in the United States, the MN isolate has a C4 
neutralizing epitope that comprises at least about 45% 

25 of the population . The GNE 8 isolate has a C4 

neutralizing epitope that comprises at least about 45% 
of the population. Thus either isolate has the most 
common C4 neutralizing epitope in the region; When 
gp!20 from each isolate is combined in a vaccine, 

30 greater than about 90% of the C4 neutralizing epitope 

sequences are present in the vaccine. In addition, the 
amino acid sequences for the V3 neutralizing epitope in 
the MN and GNE 8 isolates are substantially similar and 
comprise about 60% of the population. Therefore, those 

35 strains have the two most common neutralizing epitopes 
for the V3 domain. In the V2 region, the MN isolate 
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amino acid sepences comprises about 10% of thef 
population; and th GNE 8 isolate amino acid sequenc^ 
comprises about : 6 p % of the population. Therefore,; the 
GNEg strain has the most common neut ra 1 i z ing epitope f pr 
5 the r^gi on ; and; the two strains together comprise the : 
two most common neutralizing epitopes for the region. : - 
A multivalent gp!20 subunit vaccine containing the two 
isolates contains amino acid sequences for epitopes 
that constitute about 70% of the V2 domain, about 60% 

10 of the V3 domain, and about 90% of the C4 domain for 
the United States . 

In a preferred embodiment of the method, one or 
more HIV isolates having an amino acid sequence for a 
neutralizing epitope in the V2 and/ or C4 domains that 

15 constitute at least : about 50% of the population for a 
selected geographic region are selected. In a more 
preferred embodiment, isolates having the most common 
neutralizing epitopes in the V3 domain are also 
included in the vaccine . 

20 As is cl6ar v once the most common amino acid 

sequences for the neutralizing epitopes in the V2, V3, 
and G4 domains are known , an isolate having a common 
epitope for each region is preferably selected. That 
is, when only two or three isolates are used for the 

25 vaccine, it is preferable to select the isolate for 

common epitopes in each region, rather than selecting 
an isolate by analysis of a single region. 

In a more preferred embodiment, gp!20 from 
isolates having epitopes that constitute at least 50% 

30 of the population for the geographic region for V2, V3, 
and C4 domains are present in the vaccine. More 
preferably, the isolates have epitopes that constitute 
at least 60% of the population for the geographic 
region for the three domains . Most preferably , 70% or 

35 more are included. 
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In another preferred embodiment, the entire amino 
acid sequence of % ^2 and C4 d mains is determined in 
the selection process • ] : in addition to selecting common 
sequences for the;; neutralizing epitopes, isolates 
5 having unusual polymorphisms elsewhere in the region 
are preferably not used for the vaccine isolates .V 

Vaccine preparation 

gpl20 from the selected HIV isolate (s) is used to 

10 make a subunit vaccine, preferably a multivalent 

subunit vaccine. Preparation of gpl20 for use in a 
vaccine is well known and is described hereinafter. 
With the exception of the use of the selected HIV 
isolate/ the gpi20 subunit- vaccine prepared in the 

15 method does not differ from gp!20 subunit vaccines of 
the prior art. 

As with prior art gpl20 subunit vaccines , gpl2 0 at 
the desired degree of purity and at a sufficient 
concentration to induce antibody formation is mixed 

20 with a physiologically acceptable carrier. A 

physiologically acceptable carrier is nontoxic to a 
recipient at the dosage and concentration employed in 
the vaccine. Generally/ the vaccine is formulated for 
injection, usually intramuscular or subcutaneous 

25 injection. Suitable carriers for injection include 
sterile water, but preferably are physiologic salt 
solutions, such as normal saline or buffered salt 
solutions such as phosphate buffered saline or ringer 1 s 
lactate. The vaccine generally contains an adjuvant. 

30 Useful adjuvants include QS21 which stimulates 
cytotoxic T-cells and alum (aluminum hydroxide 
adjuvant). Formulations with different adjuvants which 
enhance cellular or local immunity can also be used. 
Addition excipients that can be present in the 

35 vaccine include low molecular weight polypeptides (less 
than about 10 residuets) , proteins , amino acids, 
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carbohydrates including glucose or dextrans , chelating 
agents such as EDTA, and other e^ 

The vaccine can also contain other HIV proteins. 
In -part icular , gp41 or the extracellular port ion of 
: 5 ; gp41 can be preseht in the va cc irie Since gp4 1 has a 
conserved amino acid sequence, the gp41 present in the 
vaccine can be from any HIV isolate . gpl60 from an 
isolate used in the vaccine can replace gpl20 in the 
vaccine or be used together with gpl20 from the 
10 isolate. Alternatively, gpl60 from an isolate having a 
different neutralizing epitope than those in the 
vaccine isolates can additionally be present in the 
vaccine. 

Vaccine formulations generally include a total of 

15 about 300 to 600 jig of gp!20, conveniently in about 1.0 
ml of carrier. The amount of gpl20 for any isolate 
present in the vaccine will vary depending on the 
immunogenicity of the gpl20. For example, gpl20 from 
the Thai strains of HIV are much less immunogenic than 

20 gpl20 from the MN strain. If the two strains were to 
: be used in combination, empirical titration of the 
amount of each virus would be performed to determine 
the percent of the gpl20 of each strain in the vaccine. 
For isolates having similar immunogenicity, 

25 approximately equal amounts of each isolate ' s gpl20 
would be present in the vaccine.: For example, in a 
preferred embodiment, the vaccine includes gpl20 from 
the MN, GNEg, and GNE, 6 strains at concentrations of 
about 300 pg per strain in aboiit 1.0 ml of carrier. 

30 Methods of determining the relative amount of an 

immunogenic protein in multivalent vaccines are well 
known and have been used , for example, to determine 
relative proportions of various isolates in multivalent 
polio vaccines. 

35 The vaccines of this invention are administered in 

; : the same manner as prior art HIV gpl2b subunit 
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vacc iriels . Iri part icu 1 ar , the vaccines are generally 
administered 1 , arid at 6,8 or 12 mbriths, 

deperiding briv protocol . Foiiowing the immuriization 
procedure/ ai^ual or bi-annual boosts can be 
3 : administered^ However, during the immunization process 
and thereafter 7 neutralizing antibody levels can be 
assayed arid the protocol adjusted accordingly. 

The vaccine is administered to uninfected 
individuals. In addition, the vaccine can be 

10 administered to seropositive individuals to augment 
immune response to the virus , as with prior art HIV 
vaccines. It is also contemplated that DNA encoding 
the strains of gp!20 for the vaccine can be 
administered in a suitable vehicle for expression iri 

15 the host. In this way, gp!20 can be produced in the 
infected host, eliminating the need for repeated 
immun i z a t i ons . Preparation of gpl20 expression 
vehicles is described hereinafter. 

20 Production of crpl20 

gpl20 in the vaccine can be produced by any 
suitable ineans, as with prior art HIV gpl20 subuhit 
vaccines. Recombinant ly-produced or chemically 
synthesized gp!20 is preferable to gpl20 isolated 

25 directly from HIV for safety reasons . Methods for 

recombinant production of gpl20 are described below. 

DNA Encoding GNE, and GNE 1f OP120 
and the resultant proteins 
3 0 The present invention also provides novel DNA 

sequences encoding gpl2 0 from the GNEg and GNE !6 isolates 
which can be used to express gp!20 and the resultant 
gpl20 proteins . A nucleotide sequence of less than 
about 5 kilobaises (Kb) , pref erably less than about 3 Kb 
35 having the nucleotide sequence illustrated in Tables 1 
arid 2 , respectively , encodes gpl20 from the GNE 8 and 
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GNE W isolates . The sequences of the genes and t:he 
encoded ptbteins are showii below in Tables lr-3 . ih 
particular; Table 1 illustrated the nucleotide sequence 
(SEQ. ID. NO; 27) and the predict ed ; amino acid sequence 
5 (SEQ. ID; kOv 28) of tti£ GNE 8 isolate of HIV. The upper 
sequence is the coding strand. The table also . 
illustrates the location of each of the restriction 
• sites • ■-.*■■■■ 
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Table 2 illustrates the ri^ aiid 
the predicted amino acid: secpaehce of the GNE^ isolate 
of HIV. The upper s equence is; - the coding strand . The 
table also i^ of the 

5 restriction sites . The fir s t i our pages of the table 
are from one clone of the gene and the second three 
pages of the table are from another clone of the gene. 
The sequences of the clone? differ by about 2%. (The 
nucleotide sequences are SEQ. ID. NOs. 28 and 29, 
10 respectively. The amino acid sequences are SEQ. ID. 
NOs. 30 and 31, respectively .) It is noted that each 
of the sequences includes a stop codon. A gene 
sequence that encodes full length gpl20 can be made by 
repairing one of the sequences. 
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Table 3 illustrates the amino acid sequences for 
the MNy GNEg, and GNE 16 gpl2 0 proteins . The region? of 
the sei^uehces having identical amino acid sequences ire 
enclosed in boxes. . 

■ • ' TABLE 3 - ', 
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: Nucleic acid sequences encoding gpl20: from GNII 8 and 
•:V GN£ 16 capable of .expressing gpl20 can prepared by ; ' ■ : 

»■ conventional means . The nucleotide sequence can be 
5 synthesized. Alternatively, another HIV nucleic acid 
sequence encoding gp 12 0 can be used as a backbone and 
altered at any differing residues by site directed 
mutagenesis as described in detail in Example 1. 

In a preferred embodiment, the nucleotide sequence 
10 is present in an expression construct containing DNA 
encoding gpl20 under the transcriptional and 
translatipnal control of a promoter for expression of 
the encoded protein. The promoter can be a eukaryotic 
promoter for expression in a mammalian cell. In cases 
15 where one wishes to expand the promoter or produce 
gpl20 in a prokaryotic host, the promoter can be a 
prokaryotic promoter. Usually a strong promoter is 
employed to provide high level transcription and 
expression. 

20 The expression construct can be part of a vector- 

capable of stable extrachromosomal maintenance in an 
appropriate cellular host or may be integrated into ; ^ 
host genomes. Normally, markers are provided with the 
expression construct which allow for selection of a V 

25 host containing the construct . The marker can be on 
the same or a different DNA molecule, desirably, the 
same DNA molecule. 

The expression construct can be joined to a 
replication systeia recognized by the intended host 

30 cell. Various replication systems include viral ; 

replication systems such as retroviruses, simian virus, 
bovine papilloma virus : , or the like. In addition, the 
construct may be joined to an amplif iable gene, e.g. 
DHFR gene, so that multiple copies of the gpl20 DNA can 

35 be made. Introduction of the construct into the host 
will vary depending on the construct and can be 
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achieved by any convenient means. ; A wide variety of 

' \ ; pr karyotic and eukaryotic hosts Jean be employed for •/ 
expression of the proteins. : 

Preferably; the gpl20 is expressed in mammalian 
5 cells that provide the .same glycosylation and disulfide 
bonds as in native gpl20i Expression of gpl20 and 
fragments of gpl20 in mammalian cells as fusion 
proteins incorporating N-terminal sequences of Herpes 
Simplex Virus Type 1 (HSV-l) glycoprotein D (gD-1) is 

10 described in Lasky , L A. et al . , 1986 (Neutralization 
of the AIDS retrovirus by antibodies to a recombinant 
envelope glycoprotein) Science 233: 209-212 and Haffar, 
O.K. et al,, 1991 (The cytoplasmic tail of HIV-1 gpl60 
contains regions that associate with cellular 

15 membranes.) Virol. 180:439-441, respectively. A 

preferred method for expressing gpl20 is described in 
Example 3. In the example, a heterologous signal 
sequence was used for convenient expression of the 
protein. However, the protein can also be expressed 

20 using the native signal sequence. 

An isolated, purified GNE 8 -gpl20 and GNE 16 -gpl20 
having the amino acid sequence illustrated in Tables 
1-3 can be produced by conventional methods. For 
example, the proteins can be chemically synthesized. 

25 In a preferred embodiment, the proteins are expressed 
in mammalian cells using ah expression construct of 
this invention. The expressed proteins can be purified 
by conventional means. A preferred purification 
procedure is described in Example 3. 

'■: 30 

qp!20 Fragments 
The present invention also provides gpl20 
fragments that are suitable for use in inducing 
antibodies for use in serotyping or in a vaccine 
35 formulation. A truncated gpl-20 sequence as used herein 
is a fragment of gpl20 that is fr e fr m a portion of 
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the intact gpl2:b sequence b ginning at either the amino 
or ; car boxy terminus of gpl20. A trnincated gpl20 
sequence of this invention is free from the C5 domain ; 
The C5 domain of -gpl20 is a ma j or iiimuhogenic site of 
5 : ■ the molecule. However, ant ibod ies t o the r eg ion do not 
; neutralize: virus . Therefore, elimination of this 
portion of gpl20 from immuhogens used to induce 
antibodies for serotyping is advantageous. 

In another embodiment , the truricated gpi20 
10 sequence is additionally free from the carboxy terminus 
region through about amino acid residue 4 53 of the 
gpl20 V5 domain. The portion of the V5 domain 
remaining in the sequence provides a convenient 
restriction site for preparation of expression 
15 constructs . However, a truncated gpl20 sequence that 
is free from the entire gpl20 ; V5 domain is also 
suitable for use in inducing antibodies. 

in addition , portions of the carboxy terminus of 
gpl20 can also be eliminated from the truncated gpl20 
20 sequence. The truncated gpl2 0 sequence can 

additionally be free from the gp!20 sigrnal sequence. 
The truncated gp!20 sequence can be free from the 
carboxy terminus through amino acid residue 111 of the 
gpljp CI domain, eliminating most of the CI domain but 
25 preserving a convenient r es t r i ct ion site . However , the 
portion of the CI domain through the cysteine residue 
that foinns a disulfide bond can : addi t i ona 1 ly be 
removed, so that the truncated gp!20 sequence is free 
from the carboxy terminus through amino acid residue 
30 117 of the gpl2 0 CI doma in . Alternatively, the 

truncated gpl20 sequence can be free from the amino 
terminus of gp!20 through r es i due 111 of the CI domain, 
preserving the V2 disulf ide bond. : In a preferred 
embodiment, the truncated gpl20 secjuence is free from 
35 the amino terminus of gpl20 through residue 111 of the 
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CI domain and residue 453 through the carb ky terminus 
o f (gp l 20 . 

••• The truncated gpl20 sequences can be produced by 
recombinant engineering, as described previously.; 
Cony eni en t ly , DNA encoding the truncated gpl2 0 sequence 
is joined to a heterologous DNA sequence encoding a 
signal sequence . 

Serotyping method 
10 The present invention also provides an improved 

serotyping method for HIV strains. The method 
comprises determining the serotypes of the V2, V3, and 
C4 domains of gp!20. 

HIV isolates can be serotyped by conventional 
15 immunoassay methods employing antibodies to the : 

neutralizing epitopes in the V2 , V3, and C4 domains for 
various strains of HIV. Preparation of the antibodies 
is described hereinbefore. The antibody affinity 
required for serotyping HIV using a particular 
20 immunoassay method does not differ from that required 
to detect other polypeptide analytes. The antibody 
composition can be polyclonal or monoclonal > preferably 
monoclonal. 

A number of different types of ixmnunoassays are 
25 we? 11 known using a variety of protocols and labels. 
The assay conditions and reagents may be any of a ; 
variety found in the prior art. The assay may be 
heterogeneous or homogeneous. Conveniently / ah HIV 
isolate is adsorbed to a solid phase and detected with 
30 antibody specific for one strain of neutralizing 
epitope for each neutralizing epitope in the y2 # 
and C4 domain . Alternatively f supernatant or lysate 
front the cultured isolate which contains gpl20 can be 
adsorbed to the solid phase. The virus or gp!20 can be 
35 adsorbed by many well known non-specific binding 
methods. Alternatively , an anti-gpl20 antibody, 
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. preferably directed to the carboxy terminus of gp 120. 
can be used to affix ^120 to the solid phase . A gplio 
capture antibody and sandwich ELI SA assay' f or gpl20 
neutralizing epitopes is described by Moore, AIDS Res. 
5 Hum. Retroviruses [^^^^2:1:9; (1993) . Binding between ^/v ; vv : J. - 

the antibodies aM be determined in a number 

of ways. Complex formation can be determined by use of 
soluble antibodies specific for the anti-gpl20 
antibody. The soluble antibodies can be labeled 

10 directly or can be detected using labeled second 

antibodies specific for the species of the soluble 
antibodies. Various labels include radionuclides , 
enzymes, fluorescers, colloidal metals, or the like. 
Conveniently, the anti-gpl20 antibodies will be labeled 

15 directly, conveniently with an enzyme. 

Alternatively, other methods for determining the 
neutralizing epitopes can be used. For example, 
fluorescent-labeled antibodies for a neutralizing 
epitope can be combined with cells infected by the 

20 strain of HIV to be serotyped and analyzed by 
fluorescence activated cell sorting. 

The serotype of the HIV isolate includes the 
strain of the neutralizing epitopes for the V2, V3, and 
C4 domains. 

25 .. ■■/■v^.; : • : v :: .^ ;:: ^■ ; v■. . 

It is understood that the application of the 
teachings of the present invention to a specific 
problem or situation will be within the capabilities of 
one having ordinary skill in the art in light of the 

30 teachings contained herein . . Examples of the products 
of the present invent ion and representative processes 
for their isolation/: use, and manufacture appear below, - 
but should not be construed to limit the invention. 
All literature citations herein are expressly * 

35 incorporated by reference. 
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y- EXAMPLE 1 
Id ntification of C4 Neu&alizing Epitopes 
The following reagents and meth were used in 
the studies described herein. 

5 ••• \'' Y" v ' v : . • ■ ' • "\ 'v§-;A :■ ; 

gpl2 0 sequences and: nomenclature* Amino acid 
residues are designated using the standard single 
letter code. The location of amino acids within the 
gpl20 protein is specified using the initiator 

10 methionine residue as position 1. The designation LAI 
is used to describe the virus isolate from which the 
HIV-l^, HlV-lnro, HIV-1 BRU/ HIV-l^/ HIV-l^ and 
HIV- Ihxbio substrains (molecular clones) of HIV- 1 were 
obtained. The sequence of gpl20 from IIIB substrain of 

15 HIV-1^, is that determined by Muesing et al. (30) . 

The sequence of gp!20 from MN strain of HIV-1 is 
given with reference to the MNgpl20 clone (MNqnb) . The 
sequence of this clone differs by approximately 2% from 
that of the MNi 9g4 clone described by Gurgo et al. (13 ) . 

20 The sequences of gpl20 from the NY-5, JRcsf , 26, 2321, 
and HXB2 strains of HIV-l are those listed by Myers et 
al . (32) except where noted otherwise. The sequence of 
the Thai isolate A2 4 4 is that provided by McCutchan et 
al. (24) ... The variable (V) domains and conserved (C) 

25 domains of gpl20 are specif ied according to the 
nomenclature of Modrow etal. (28) . 

Monoclonal antibody production and screening 
assays. Hybridomas producing monoclonal antibodies to 

30 MN-rgpl20 ( recombinant ly produced gpl20 from the MN 

strain of HIV) (3) were prepared and screened for CD4 
blocking activity as described previously ( 7 , 33) . The 
binding of; monoclonal antibodies to MN-rgpl20 and to 
rgpl20s from the IIIB, NY-5, 26, Z321, JRcsf, and A244 

35 strains of HIV-1 was assessed by enzyme linked 
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immunoadsor bant assays (ELISA) as desci^it>ed previously 
• ( 3 3) • . 

Virus binding and neutralization assays. The 

5 ability of monoclpnal antibodies to neutralize HTV^l 

infectivity in vitro was assessed in a colorimetric MT- 
2 cell cytotoxicity assay similar to that described 
previously (35). MT-2 cells and H9 / HTLV- I 1 1 mn cells 
were obtained through the AIDS Research and Reference 

10 Reagent Program, Division of AIDS , NIAip, NIH : 

contributed by Drs. Douglas Richman and Robert Gallo, 
respectively. Briefly, serial dilutions of antibody or 
serum were prepared in 50 fil volumes of complete and 
then 50 pi of a prediluted HIV-1 stock was added to 

15 each well. After incubation for 1 hr at 4*C, 100 y.1 of 
a 4 x io 5 MT-2 cell/ml suspension was added. After 
incubation of the plates for 5 days at 37 °C in 5% C0 2 , 
viable cells were measured using metabolic conversion 
of the formazan MTT dye. Each well received 20 jil of a 

20 5 mg/ml MTT solution in PBS. 

After a 4 hr incubation at 37 °C, the dye 
precipitate was dissolved by removing 100 /il of the 
cell supernatant, adding 130 pi" of 10% Triton X-100 in 
acid isopropanol, then pipeting until the precipitate 

25 was dissolved. The optical density of the wells was 
determined at 540 ran. The percentage inhibition was 
calculated using the formula: 

.'. 1-/ virus control-experimental V 
(virus control -medium control) 

30 ; . • : - . : ' ; 

Cell surf ace staining of HIV-1 infected cells with 
Monoclonal antibodies . H9 cells (2 x 10 s ) chronically 
infected with the IIIB, HXB2, HXB3 , and HX10 substrains 
of or with HIV- Imn were incubated f or 30 min at 

35 room temperature with monoclonal antibodies (10 /xg per 
ml) in 100 pi of RPMI 1640 cell culture media 
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containing 1% FCS. Ceills were washed arid then 
incubated with 20 per ml of f luor es cein^cqn j uiga ted , 
aff inity goat antibody to mpxise I^G (Fab • ) 2 

(Cappel, West Chester > PA ) for 30 mini ^Mis'-were 
washed, fixed with 1% paraformaldehyde and the bbtind ; 
antibody was quantitated by flow cytometry using a 
F ACS CAN (Becton-Dickenson , Fullerton, CA) , 

Fluorescence data was expressed as percentage of 
fluorescent cells compared to the fluorescence obtained 
with the second antibody alone. Fluorescence was 
measured as the mean intensity of the cells expressed 
as mean channel number plotted on a log scale. 

: Fragmentation of the MN-rgpl20 gene. Fragments of 

the MN-rgpl20 gene were generated using the polymerase 
chain reaction (PCR) (17). Briefly/ forward 30-mer 
oligonucleotide DNA primers incorporating a Xho 1 site, 
and reverse 3 6-mer oligonucleotide DNA primers 
containing a stop codon followed by a Xba 1 site were 
synthesized and used for the polymerase chain 
reactions. Thirty cycles of the PCR reaction were 
performed using 0.3 jig of a plasmid containing the gene 
for gp!20 from the MN strain of HIV-1 (pRKMN. D533) and 
0.04 nM of a designated primers. The PCR reaction 
buffer consisted of 0.1 M Tris buffer (pH 8 . 4 ) ., 50 mM 
KC1, 0.2 mM 4dNTP (Pharmacia, Piscataway, NJ) ., 0.15 M 
MgCl 2 and 0.5 Unit of Taq Polymerase (Perkin-Elmer 
cetus, Norwalk, CN) and a typical PCR cycle consisted 
of a 60 second denaturation step at 94 °C, followed by a 
45 second annealing step at 55° C, and then an 
extension step at 72° C for 45 seconds. , 

Following the PCR ampl if icat ion , the PCR products 
Were purified by phenol and chloroform extraction, and 
then ethanol precipitated . The purif ied products were 
then digested with the restriction endonucleases Xhol 

••: and Xba 1. The resulting PCR products were gel purified 
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: using i% agarose ( SEAKEM, FMC Bioproducts , Rockland , 
ME) r 5% ;; pblya gel electrophoresis (PAGE) and 

then isolated; by electroelution. 

5 8 ite directed mut agenes is of the MN-rgpl20 C4 

domain. A recombinant PGR technique (15) was utilized 
to introduce single amino acid substitutions at 
selected sites into a 600 bp Bgl II fragment of 
MN-rgpl20 that contained the C4 domain. This method 

10 entailed the PGR amplification of overlapping regions 
of the G4 domain of gpl20 using primers that 
incorporated the desired nucleotide changes. The 
resultant PCR products were then annealed and PCR 
amplified to generate the final product. For these 

15 reactions 18-mer "outside" primers encoding the wild 

type sequence (Bgl II sites) were amplified with 3 isomer 
"inside" primers that contained the alanine or glutamic 
acid residue changes. The first PCR reaction included 
IX of the Vent polymerase buffer (New England Biolabs, 

20 Beverly , MA) ,0.2 mM of 4dNTP (Pharmacia, Piscataway, 
N.J. ) , 0. 04 nM of each synthetic oligonucleotide, 0.3 
j*g of linearized plasmid, pRKMN.D533, which contained 
the MN-rgpl2 0 gene . Thirty PCR cycles were performed 
consisting of the following sequence of steps.: ; ;45; : '; : ::; ; v :'; : ;;' : : . 

25 seconds of denaturatipn at 94*C> 45 second of annealing 
at 55 °C arid 45 seconds of extension at 7 2 ° C . Following 
PCR amplification, the product pairs were gel purified 
using a 1% solution of low melt agarose (SeaPlaque, FMC 
Bioproducts, Rockland, ME) . 

30 The agarose containing PGR product was melted at 

65 0 C and combined with the PCR product of the \- ; . 
overlapping pair and equilibrated to 37°C. Added to r 
this (2a;^ pi of l OX Vent Polymerase buffer/ ' 

10 fil of 2 mM 4dNTP, 0. 04 riM each of the "outside" wild 

35 type 18 mer oligonucleotides, 57 jil of H 2 0 and 1 unit of 
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Vent Polymerase. Thirty PGR cycles were performed as 
.. . previously above. '. 

The result!^ 

digested with th^ Bgl I i endohucl ease . The digested 
5 PCR product was then Iigated into the ma^ 

expression vector pRKMN. D533> Which had been digested 
with Bgl II allowing for the removal of a 600 bp 
fragment . Colonies containing the correct insertion 
were identified and Sequenase 2.0 supercoil sequencing 
10 was employed to check for fidelity and the 
incorporation of the desired mutation. 

Expression of gpl2 0 fragments in mammalian cells. 

Fragments of the MN and IIIB gpl20 were expressed in 

15 mammalian cells as fusion proteins incorporating 

N-terminal sequences of Herpes Simplex Virus Type 1 
(HSV-1) glycoprotein D (gD-i) as described previously 
(14> 22). Briefly, isolated DNA fragments generated by 
the PCR reaction were Iigated into a plasmid (pRK»gD-l) 

20 designed to fuse the gp!20 fragments, in frame, to the 
5 • sequences of the glycojprqtein D (gD) gene of Type 1 
Herpes Simplex Virus (gD-1) and the 3 • end to 
translational stop codons . The fragment of the gD-1 
gene encoded the signal sequence and 2 5 amino acids of 

25 the mature form of HSV-1 protein. To allow for 

' expression in mammalian cells, chimeric genes fragments 
were cloned into the pRK5 expression plasmid (8) that 
contained a polyliriker with cloning sites and 
translational stop codons located between a 

30 cytomegalovirus promotor and a simian virus 40 virus 
polyadenylation site . 

The resulting plasmids were trans feet ed into the 
293s embryonic human kidney cell line (12) using a 
calcium phosphate technique (11) . Growth conditioned 

35 cell culture media was collected 48 hr after 

transf ection/ and the soluble proteins were detected by 
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ELISA or by specif ic iradioi^uriopr cipitation Where 
metabdlically labeled pr teins from cell cultiire ; 
supernatant s were resolved ; by sodium dodecyl sulfate 
poly aery laiaide gel; electrc^hbtesis (PAGE) and 
5 visualized by autoradiography as described previously 
(!, 18). 

Rad io immunopr ec ip it at ion of MN-rgpl20 mutants. 

Plasmids directing the expression of the MN-rgpl2 0 C4 

10 domain mutants were trans feet ed into 293s cells as 
described above. Twenty four hours following the 
transfection, the cells were metabolically labeled with 
[ 35 S ] -labeled methionine or cysteine as described 
previously (1). The labeled cell culture supernatants 

15 were then harvested and 0.5 ml aliquots were reacted 

with 1-5 fig of the monoclonal antibody or with 2 p 1 of 
the polyclonal rabbit antisera to MN-rgpl20 and 
immunoprecipitated with Pahsorbin (CalBiochem, La 
Jolla, CA) as described previously (1). The resulting 

20 Pansorbin complex was pelleted by centr if ugation, 

washed twice with a solution containing PBS f 1% NP-40 
and 0.05% SDS and then boiled in a PAGE sample buffer 
containing 1% 2-mercaptoethanol. The processed samples 
were the analyzed by SDS-PAGE and visualized by 

25 autoradiography (1, 18). 

Assays to measure the binding of monoclonal 
antibodies to mutagen! 2 ed MN-rgp 12 0 polypeptides . An 

ELISA was developed to screen for reactivity of 
30 MN-*rgpl20 fragments and mutant proteins with various 
monoclonal antibodies. In this assay, 96 well 
microtiter dishes (Maxisorp, Nunc, Roskilde , Denmark) 
were coated overnight with mouse monoclonal antibody 
( 5B6 ) to gD-1 , at a concentration of 2 . 0 (iq /ml in 
35 phosphate buffered saline (PBS) . The plates were 

blocked in a PBS solution containing 0 . 5% bovin serum 
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; albumin /(-PBS A), and then incubated witih growth 

conditioned cell: cuitur medium f rom tranisf ected cells 
^sxpresising the recombinant gp!20 variants for 2 hr at 
; ;: ; ; room temperature . The plates werie washed three times 
5 ^in PBS containing 0 . 05% Tween 2 0 and then incubated 
with the purified, HRPO-conjugated monbclorial 
antibodies. Following a 1 hr incubation/ the plates 
were washed three times and developed with the 
colorimetric substrate, o-phenylenediamine (Sigma, St. 
10 Louis, MQ) . 

The optical densities in each well were then read 
in a microtiter plate reading spectrophotometer at 
492 run. Each cell culture supernatant containing 
fragments or mutated rgpl20s was normalized for 
15 expression based on the titering of its reactivity to 
the V3 binding monoclonal antibody 1034 or to a rabbit 
polyclonal antisera to MN-rgp 120. Data from these 
experiments were expressed as a ratio of the optical 
: densities obtained with the CD4 blocking monoclonal 
20 antibodies binding to the fragments or MN-rgpl20 
. , mutants compared with the full length wild type 



To normalize for different concentrations of 
MN-rgpl20-derived protein in the cell culture 

25 supernatants , the binding of the CD4 blocking 

iabnoclonal antibodies to each preparation was compared 
to that of an HRPO-con jugated monoclonal antibody to 
the V3 domain of MN-rgpl20 (1034) . Data from these 
experiments were expressed as a ratio of the optical 

30 densities obtained with the CD4 blocking monoclonal 
antibodies to the HRPO con jugated V3 reactive 
. monoclonal antibody. 



CD4 binding assays. The ability of monoclonal 
35 antibodies to inhibit the binding of MN-rgpi20 to 

\ ^-\i':- io&c' mbinant soluble: CD4 : (rsCD4) was deterinined in a 
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: solid phase radioimmunoassay similar to • that : descri^ 

pr-ey ious ly ( 3 3 ) . The ef f ect of single amino ^ci4 ;- ; 
v substitutions on the binding of MN-rgp 1 2 0 mut ant s y t o 
CD4 was deteintoined in a do-immiinopr 
5 ; similar to that described previously (21) ♦ Briefly; 

293 ciells were metabolically labeled with 35 S-methiohine 
24 hr after transf ection with plasmids expressing 
MN-rgpl20 variants. Growth conditioned cell culture 
medium (0.5 ml) was then incubated with 5.0 /xg of 

10 recombinant sCD4 for 90 minutes at room temperature. 
Fbl lowing this incubation, 5.0 /ig of an anti-CD4y 
monoclonal antibody (465) , known to bind to an epitope 
remote from the gpl20 binding site, was added and 
allowed to incubate another 90 minutes at room 

15 temperature. 

The gpl20-CD4-antibbdy complexes were precipitated 
with Pahsorbiri that had been washed with PBS, 
preabsorbed with 0.1% bovine serum albumin and then 
bound with 50 fig of an affinity purified rabbit ahti- 

20 mouse IgG (Cappel, West Chester, PA) . The pellet was 
washed twice with PBS 1% NP-40, 0.05% SDS, and then 
boiled in beta mercaptoethanol containing SDS-PAGE 
sample buffer . The immunoprecipitation products were 
resolved by SDS PAGE and visualized by autoraMography 

25 as described previously (1, 21) . 

Antibody affinity measurements / Anti-gpl20 
antibodies were iodinated with Na 125 I with iodogen* 
(Piercie, Rockford, XL). Briefly, 50 fig of antibody in 

30 PBS was placed in 1.5 ml polypropylene microcentrifuge 
tubes coated with 50 /xg of Iodogen. Two millicuries of 
carrier free Na[ ,25 I] was added. After 15 min., free ,25 I 
was separated from the labeled protein by 
chromatography on a PD-10 column (Pierce, Rockford, IL) 

35 pre-equilibrated in PBS containing 0.5% gelatin. 
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Antib dy conGentratipns following ipdinat ion were 
^ determined 

For binding ^ass ay s; 9 6 -we 1 1 . micro titer plates were 
coated with 100 jiiywell of a 10 Mg/ml solution of 
5 MN-r gp 12 0 or liifirgpi20: in o . 1 M bicarbonate buffer, 
y pH 9 . 6 and ihcubat^ 2 hr at room temperature or 
overnight at 4°C. To prevent non-specific binding, 
plates were blocked f or 1-2 hr at room temperature with 
200 pi /well of a gelatin solution consisting of PBS 

10 containing 0,. 5% (wt/vol) gelatin and 0 - 02% sodium 

azide. Unlabeled anti-gpl20 monoclonal antibody (0 to 
400 nM) was titrated (in duplicate) in situ and 
radiolabeled antibody was added to each well at a 
concentration of 0.5 nM. 

15 After a 1-2 hr incubation at room temperature , the 

plate was washed 1 Ox with the PBS/0.5% gelatin/0. 02% 
i azide buffer to remove free antibody. The antibody- 
gpl20 complexes were splubilized with 0.1 N NaOH/ 0.1% 
SDS solution and counted in a gamma counter. The data 

20 were analyzed by the method of Scatchard (40) using the 
Ligand analytical software program (31) . Kj values 
reported represent the means of four independent 
determinations. 



25 RESULTS 

Character ization of monoclonal antibodies to MN-r gp 120 
that block CD4 binding . Monoclonal antibodies prepared 
from mice immunized: with MN-rgpl20 (3, 33) , were 
screened for the ability to bind to MN-rgpl20 coated 

30 microt iter dishes by ELI SA as described previously 

(33) . Of the thirty five clones obtained, seven were 
identified (1024, 1093:, 1096 , 1097, 1110, 1112, and 
1127) that were able to inhibit the binding of 
MN-rgpl20 to recombinant CD4 in ELISA (Figure 1) or 

35 solid phase or cell surf ace radioimmunoassays (21, 33). 
Previous studiies hav shown that two distinct classes 
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of CD 4 blocking monoclonal antibodies occur : those that 
bihd to Conformation dependent (discontinuous) Epitopes 
(16, 26> 33/ 35, 45) and those that bind 1 6 
: conf ormation independent (sequential) epitopes ^^^7!$;,: 
5 21, 33 7 43) ; 

. To distinguish between these two alternatives, the 
binding of the monoclonal antibodies to denatured 
(reduced and carboxymethylated) MN-rgpl2Q (RCM-gpl20) 
was measured by ELISA as described previously (33). As 
10 illustrated in Table 4, below, it was found that all of 
the GD4 blocking monoclonal antibodies reacted with the 
chemically denatured protein; indicating that they all 
recognized conformation independent (sequential) 
epitopes. 

15- 
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Table 4 '-.■■■"v. 

Proper ties : of m noc 1 hal aritib di^s to MN-rgplS 0 



CD i HIV-1 mn .. C4 rgl20 

5 Itihi- Neutral- HIV-1 mn CM- Domain cross 

MAb bltora ization V3 232120 peptides reactivity 

1024 + + ' - + • 2 : 

1093 + + - + ' - 2 

10 1096 + + - + • 2 

y.:- 1097 + + - ' + '. '2 

1110 + .+..:..■" - + - . • • 2 

1112 + + . - + - 2 

1127 + + - + - . ? 

15 1026 - .+ + + " 1,2,3,4,6 

1092 - - " - + - 1,2,3,4,5 

1126 - - - + • 1,2,3,5,7 

1086 - - - + - 2 

13HB + . - + 1,3 1,2,3,4,5,6,7 

rgpl20 cross reactivity: 1, IIIB-rgl20; 2, MN-rgpl20, 



3, NYS-rgpl20; 4, JrCSF-rgpl20; 5, Z6-rgpl20; 6, 2321- 
rgpl20; 7, A244-rgpl20 

25 : . 

C4 domain peptides: 

1 , FINMWQEVGKAMYAPPIS (SEQ. ID. NO. 24); 

2, MWQEVGKAMYAP (SEQ. ID. NO. 25) ; : 
3 , GKAMYAPPIKGQIR (SEQ . ID. NO. 26) : 

;; : .3p :i . : : ■ ' . . . .,v. : v • . - ■ , • : , 

The cross reactivity of these monoclonal 
antibodies was assessed by ELISA as described 
previously (33). In these experiments, the ability of 
the monoclonal antibodies to bind to a panel of seven 

35 rgpl20s, prepared from the IIIB, MN> Z6, Z321, NY-5, 
A244, and JRcsf isolates of HIV*l> was measiired by 
ELISA (33) . It was found that all of the CD4 blocking 
monoclonal antibodies were strain specif ic and bound 
only to gplZO from the MN strain of HIV-1 (Table 4) . 

40 However, other antibodies from the same fusion 
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(1026 f 1092, and 1126) exhibited much broader cross 
reactivity (Tabl 4; Figure 2) , as : .;dM;^':-<i4 bl eking 
mpnqclohal ahtibbd^ to IIIB-rgpl2 0 fl3H8) described 
[previously (33) *• 
5 Further studies were performed to characterize the 

neutralizing activity of the antibodies to MN-rgpl20. 
In these studies, monoclonal antibodies were incubated 
with cell free virus (HIV-l^) , and the resulting 
mixture was then used to infect MT-2 cells in 

10 microtlter plates. After 5 days, this plates were 

developed by addition of the colorimetric dye, MTT, and 
cell viability was measured spectr ophot ometr i ca 1 ly • It 
was found (Table 4, Figure 2) that all of the CD4 
blocking monoclonal antibodies were able to inhibit 

15 viral infectivity. However the potency of the 

monoclonal antibodies varied considerably with some 
monoclonal antibodies (eg. 1024) able to inhibit 
infection at very low concentrations (IC50 of 0.08 ^g 
per ml) whereas other monoclonal antibodies (eg. 1112) 

20 required much higher concentrations (IC M of 30 /ig per 

': : ::- : ;x- : ;: mi : ) .. In control experiments two monoclonal antibodies 
: to MN-rgpl2 0 from the same f u s i on ( eg . 1086,1 09 2 ) were 
ineffective, whereas the 1026 monoclonal antibody 
eixhibited potent neutralizing activity. Similarly, 

25 ^nonoclonal antibodies to the V3 domain of IIiB-rgpl20 

(I0F6, 11G5) known to neutralize the infectivity HIV-l^ 
(33) , were unable to neutralize the HIV-l,^ virus . 

Binding studies using synthetic peptides were then 
performed to further localize the epitopes recognized 

30 by these monoclonal antibodies as described previously 
(33) . When a peptide corresponding to the V3 domain 
(3) of MN-rgpl20 was tested, it was found that none of 
tihe CD4 blocking antibodies showed any reactivity. 
However the epitope recognized by the non-CD4 blocking 

35 monoclonal antibody, 1026, prepared against MN-rgpl20 
could be localized to the V3 domain by virtue of its 
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binding to thiis peptide. In Qtixer experdjients , = three 
^ synthetic peptides fr m the C4 dpmain of gpi? 0 that 
incorporated sequences recognizetd by the CD4 blocking, 
weakly neutralizing monoclonal antibodies described by 
5 ' ' ^Keating et al . (26) were tested (Table 4) . It was 
/found that nbh the CD4 . blocking monoclonal 
antibodies to MN-rgpl20 reacted with these peptides, 
however the non-neutralizing, CD4 blocking 13H8 
monoclonal antibody bound to the peptides corresponding 

10 to residues 423-440 of IIIB-gpl20 and residues 431-441 
of MN-gpl20 , but not to that corresponding to residues 
426-437 of IIIB-gpl20. Thus the 13H8 monoclonal 
antibody recognized a epitope that was similar, if not 
identical, to that described by McKeating et al • (26) . 

15 This result is consistent with the observation that the 
13H8 monoclonal antibody and the monoclbhal antibodies 
described by Cordell et al. (4) and McKeating et al. 
(26) exhibited considerable cross reactivity, whereas 
the antibodies to MN-rgpl20 were highly strain 

20 specific. 



CD4 blocking antibodies recognize epitopes in the 
C4 domain. Previously, a strain specific, CD4 blocking 
mpnoclonal antibody (5C2) raised against : IIIB-rgpl20 

•.2§V:' "i-'-'wa'S: found to recognize an epitope in thie C4 domain of 
IIIB-rgpl20 (21, 33) . Although the 5C2 monoclonal 
antibody was able to block the binding of rgpl20 to 

; GD4 , it was unable to neutr a 1 iz e HIV-1 inf ect i vity in 

vitro (1) . Affinity columns prepared from 5 C2 adsorbed 

30 an 11 amino acid peptide (residues 422 to 432) from a 
tryptic digest of gpl20 (21) , however monoclonal 
• antibody 5C2 was unable to recognize this peptide 

boated onto wells of microtiter dishes in an ELISA 
; format (Nakamura et al. , unpublished results) . 

35 To determine whether the CD4 blocking monoclonal 

antibodies raised against MN-rgpl20 recognized the 
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corresponding epitope in the C4 domain of MNr-rgpl2 0 ; a 
series of over lapp 

dbmairis of HIV-l,^ gp!2 0 , were prepared for expression 
in mammalian cells . A diagram of the fragments 
5 e^ress^d is shown in Figures 3A and 3EU The ; e4^^ d 
fragments were expressed as fusion proteins that 
incorporated the signal sequence and amino terminal 25 
amino acids of HSV-1 glycoprotein D as described above. 
Plasmids directing the expression of the chimeric 

10 C4: domain fragments were trans fected into 293 ceils, 
and their expression was monitored by 
radioimmunoprecipitation studies where a monoclonal 
antibody, 5B6, specific for the mature amino terminus 
of glycoprotein D was utilized. It was found 

15 (Figure 3B) that all of the fragments were expressed 
and exhibited mobilities on SDS-PAGE gels appropriate 
for their size. Thus fMN. 368-408 (lane 1) exhibited a 
mobility of 19 kD; fMN. 368-451 (lane 2) exhibited a 
mobility of 29 kD; fMN. 419-433 (lane 3) exhibited a 

20 mobility of 6 kD, and fMN. 414-451 (lane 4) exhibited a 
mobility of 6.1 kD. 

: -'::'; ; The : binding of monoclonal antibody 1024 to -the 
recombinant fragments was then determined by ELISA (as 
described in Example 1) . ; It was found ( Figure 3 A) that 

25 monoclonal antibody 1024 reacted with thei fragments 

that contained the entire C4 domain of MN-rgpl20 (fMN 36g . 
4ii > f MN40MJ5) ' hxxt failed to bind to a fragment derived 
from the adjacent V4 domain (fMN^Mos) or to anbther 
fragment that contained V4 domain sequences and the 

30 amino terminal half of the C4 domain (fMNj^^);. rrhe 

fact that 1024 bound to the fMN 4Ml and fMN^ fragments 
demonstrated that the epitopes recognized by all of 
these monoclonal antibodies were contained entirely 
between residues 419 and 443 in the C4 domain. 
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Residues rec gnized by m n clonal antib di s that 
block bihdli^ ^ to CD4 . To Identify \^^;^^r^^^^^^^^^^:\ 

■ v specif ic amind^ a residues that might be part bf the ; 

epitopes recognized by these monoclonal antibodies, the 
5 sequence of ; th^ Cf domain of MN-xgpl20 was compared to 
those of the gp!20s from the six other rgpl20s that 
failed to react with the CD 4 blocking monoclonal 
antibodies (Figure 4). It was noted that the sequence 
of MN-rgpI20 was unique in that K occurred at position 

10 429 whereas the other rgpl20s possessed either E,G, or 
R at this position. Another difference was noted at 
position 440 where E replaced K or S. To evaluate the 
significance of these substitutions , a series of point 
mutations were introduced into the MN-rgpl20 gene 

15 (Figure 5). Plasmids expressing the mutant proteins 
were transf ected into 293s cells, and expression was 
verified by radioimmunoprecipitation with a monoclonal 
antibody (1034) directed to the V3 domain of 
MN-rgpl20. Cell culture supernatant s were harvested 

20 and used for the monoclonal antibody binding studies 
shown in Table 6. To verify expression, radio- 
immunoprecipitation studies using cell culture 
supernatants f rom cells metabolically labeled with 
[ 35 ] S-methibnine were perf ormed using the 1024 

25 monoclonal ant ibody specif ic for the C4 domain of 

MN-rgpl20 (A) or the 1034 monoclonal antibody specific 
for the V3 domain of MN-rgpl20. Immune complexes were 
precipitated with the use of fixed S. aureus and the 
adsorbed proteins were resolved by SDS-PAGE. Proteins 

30 were visualized by autoradiography. The samples were: 
Lane 1, \-m : ;4i : -9Ai;r:i : iah* 2 MN.421A; lane 3 MN.429E; lane 
4 , MN.429A; lane; 5> MN.432A; lane 6, MN.440A; lane y7.,^..^..\^ 
MN-rgpl2p. Thet immuhopr ec ip it at ion study showed that 
1024 antibody binds well to all the variants except 3 

35 and 4 which are mutated at residue 429. 1034 antibody 
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was used as a control and precipitates with anti^V3 
antibodies • 

The effect of these mutations on tte of 
the CD4 blocking monoc lona 1 ant ibodies was then 
; ; 5\ evaluated by ELISA as illustrated in Table 5, below. 

Table 5 " 

Binding of GD4 blocking monoclonal 
antibodies to C4 domain mutants 

10 



Proteins/ 
MAbs 


1024 


1093 


1096 


1097 


1110 


1112 


1127 


5G2 


MN-rgpl20 


1.0 


1.6 


1.0 


1.0 


1.0 


i.o 


1.0 


0. 05 


MN-419A 


i.ii 


1.10 


0.94 


1.21 


0.78 


0.95 


1.10 


ND 


Mk-421A 


l.ii 


1.60 


0.88 


1.42 


1.34 


0.91 


1.10 


ND 


MN-429E 


0.03 


0.07 


0.11 


0.04 


o.io 


0.10 


0.02 


ND 


MN-429A 


0.10 


0.07 


0.14 


0.04 


0.09 


0.11 


0.05 


ND 


MN-432A 


0.77 


0.15 


0.59 


0.08 


0.12 . 


0.24 


0.26 


ND 


MN-440A 


1.06 


1.13 


1.08 


0.87 


1.12 


1.0 


1.3 


ND 


IIIB-rgpl20 


0.03 


. ND 


ND 


ND 


Nb 


ND 


ND 


1.0 


MN-423F 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


0.45 


MN-423F/429E 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


1.09 



25 Data represent the relative binding of MAbs to the native and 
mutant forms of rgpl20. Values were calculated by dividing the 
binding (determined by ELISA) of the CD4 blocking MAbs to the 
proteins indicated by the values obtained for the binding of a V3 
specific MAb (1034) to the same proteins (as-described in 

30 Example 1 ) . " . . 



It was found that replacement of l^with an A 
residue (MN. 44 OA) had no effect oh the binding of the 
1024 monoclonal antibody or any of the other CD4 

35 blocking monoclonal antibodies (Table 5) . The 

significance of K at position 429 was then Evaluated by 
:. :. substitution of either A (MN.429A) or E (MN.429E) at 
this location. It was found that the A f or K 
substitution at position 429 (MN. 420A) markedly reduced 

40 the binding of the 1024 monoclonal antibody and all of 
the other CD4 blocking monoclonal antibodies (Table 5) . 
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V Similarly, ^M-replac merit of E for k (MN.429E) at this 
position totally a gat d the binding of taie 1024^^^^^-;^ 

; monoclbna 1 antibody and a 1 1 of the other CD4 blocking 

monoclonal antibodies (Table 5) * Several other mutants 
5 were constructed to evaluate the role of positively 

charged residues in the C4 domain. , It was found that A 
for K substitutions at positions 419 (MN. 4 19 A) and 
421(MN.421A) failed to interfere with the binding of 
any of the CD4 blocking monoclonal antibodies as 

10 illustrated in Table ;6, below. 

Table 6 

Correlation Between Antibody Binding Affinity 
and. Virus Neutralizing Activity 



15 



20 



MAb 


Block 


K.. nM° 




102 4 e 




:': 2.7 ± 0.9 


0.4 


1086 c ' f 




9.7 ± 2.2 




1093 e 




9.9 ± 2.6 


3 .3 


109 6 e 


+ ' 


10 ± 6 


12 


1097° 


+ . 


13.4 ± 3.7 


12 


1110* 




12.1 ± 1.7 


12 


1112 e 


■ .+/• ■• 


20 ± 4.4 


200 


112 7 e 




9.3 ±4 


3.3 


1086^ 




9.7 ±2.2 




13H8 f * 




22 ± 6 





nM* 1 



25 



4 Blocked binding of rgpl20 MN to CD4. 

b Blocked binding of rgpl20 Illb, hot rgp!20 MN, to 

30 CD4. • 

c Mean of four determinations calculated using the 
method of Scatchard ( 40) . 

a Neutralization of HIV-l^ inf ectivity xh vitro. 

e Anti-rgpl20 MN antibody. 

35 f Did not neutralize Hiv-1 infectivity. 

* Anti-rgpl2 b Illb antibody . 
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However, when K at position 432 was replaced with 
A (MN432 . A) , the binding of all of the GD4 blocking 
antibodies was markedly reduced (Table 5) . : 
Interestingly, the binding of monoclonal antibody 1024 
5 appeared less affected by this substitution than the 
other monoclonal antibodies (Table 5) . Thus, these 
studies demonstrated that K 429 and K 432 were critical f or 
the binding of all of the CD4 blocking monoclonal 
antibodies, and that K^, K 421 , and K^o did not appear to 
10 play a role in monoclonal antibody binding. 

Amino acids recognized monoclonal antibodies that 
block binding of IIIB-rgpi2 0 to CD4 . The 

identification of residues 429 and 432 as being part of 

15 the epitope r ecogni 2 ed by the MN-r gpl2 0 specific CD4 
blocking monoclonal antibodies was particularly 
interesting since th^ previously found to 

be implicated in the binding of the 5C2 monoclonal 
antibody (21). The properties; of the 1024 like- 

20 monoclonal antibodies and the 5C2 monoclonal antibody 
differed from the C4 reactive monoclonal antibodies 
described by other investigators (4 , 43) in that the 
former appeared strain specific and the latter were 
broadly cross reactive. To account for the strain 

25 specificity of these morioclorial antibodies, the 
sequence of the eleven amino acid peptide of 
IIIB-rgpl20 recognized by monoclonal antibody 5C2 was 
compared to the corresponding sequence of MN-rgpl20. 
It was found that the IIIB protein differed from the 

30 MNB protein at positions 4 29 where K replaced E and at 
position 423 where I replaced F (Figure 5) . Because it 
was known from previous studies (33) that the 5C2 
monociorial antibody was unable to bind to gp!20 from 
two strains (i.e. , ^5 and : JRcsf ) that also possessed 

35 E at position 423, it selemed unlikely that this 

position could account for the strain specificity of 
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5C2. Sequence comparison (Figure 5 ) also showed that 
:; "'V • gpl20 from HIV-1^ was uh ique in that a phenylalanine 

; : residue occurred at position 423 whereas the other six 

^ strains examined possess an I at t^is position . 
5 To determine Whether residues 423 and/ or 429 could 

account for the type specificity bi the 5C2 monoclonal 
antibody, a mutant of MN-rgpl20 was constructed which 
incorporated an F for I replacement at position 423 
(MN.423F). In addition, the MN-rgpl20 mutant, MN.429E 

10 (described above) was further mutagenized to 

incorporate a F for I substitution at position 423 
(MN.423F) , thus resulting in a double mutant 
(MN.423F, 429E) whose sequence was identical to that of 
IIIB-rgpl20 within the 10 amino acid 5 C2 epitope 

15 (Figure 4). The expression of these mutants in 293s 
cells was verified by radio immunopr ec ip it at ion using 
rabbit polyclonal antisera to MN-rgpl20. When the 
binding of the 13H8 monoclonal antibody to a set of 
mutants incorporating substitutions at position 423 and 

20 429 was examined, it was found that none of the 
; replacements effected the binding of this antibody 

(data not shown) . When the 5C2 monoclonal antibody was 
examined, it was found that the F for I replacement 
(MN.423 F) conferred partial reactivity (Table 5) . 

25 When the double mutant (MN . 423 F , 4 29E) , containing the F 
for I substitution as well as the E for K substitution 
was tested, binding that was indistinguishable from 
that to TITB-rgpl2 0 was observed (Table 5) . These 
results demonstrated that F at position 423 and E at 

30 position 429 both play a role in binding of the 5C2 
monoclonal antibody, and suggest that the strain 
specificity of 5C2 can be attributed to the residues at 
these positions. 

Examination of the sequences of gp!20 from the 
various clones of LAI that have been analyzed revealed 
" that several substrains of LAI differed from each other 
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in the C4 domain. Thus the sequences of the IIIB (30) , 
Bru (46) , and HXB3 (6) clones: o^ 
j-^-V- .-positions 423 and 429 where F airid : E res idues occurred 
respectively . However , * the sequeilce of the HXB2 
5 substrain (36) differed from the others at these 

positions where, like MN-rgpl20, k replaced E and at 
position 423 where I replaced F (Figure 5). Similarly, 
the HX10 and BH10 substrains (36 , 37) differed only at 
position 423 where, like HIV-l^, I replaced F. Based 

10 on the mutagenesis experiments above, it would be 

predicted that monoclonal antibody 1024 should be able 
to bind to gpl20 from the HXB2 substrain of LAI, but 
not the HXB3 substrain. If I 42 3 was important for 
binding, then 1024 should also bind the HX10 substrain . 

15 To test this hypothesis , the binding of monoclonal 

antibody 1024 to the surface cells infected with either 

■: IIIB, HXB2, HXB3 , and HX10 substrains of HIV- 1^ was 
measured by flow cytometry. It was found that 
monoclonal antibody 1024 was able to bind only HXB2 

20 providing further confirmation that residues 423 arid 
429 were important for the binding bf-this antibody. 
The fact that monoclonal antibody 1024 did not bind to 
HXld infected cells suggested that I 42 3 was not important 
for the binding of this monoclonal Antibody. Thus 

25 these studies demonstrate that react ivity^ the 1024 

monoclonal antibody segregates with the occurrence of F 
and E residues at positions 423 and 429, respectively, 
and shows that substrains of HIV^l^ -dif f rom one 
another at a functionally significant epitope in the C4 

30 domain . . 



Neutralizing activity of CD4 blocking antibodies 
correlates with their binding iff inityv To account for 
the difference in virus neutral izirigr activity between 
35 the CD4 blocking monoclonal antibodies, their gpi20 
binding affinities were determined by competitive 
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binding of -labeled mohoclbnal to rgpi20 

( Table 6 ) . Typica 1 Sea t char d analy s is of data firpm 
these aissays is shown in Figure 7 (A to Cj . Linear', 
one-site binding kinet i cs were obs erved for all the 
5 monoclone to MN-r gp 12 Q , suggest ing that 

only a single class: of sites was recognized/ m 
there was no cooper at ivity between two combining sites 
of eaich immunoglobulin molecule. It was found 
(Figure 7A, Table 6) that monoclonal antibody 1024 , 

10 which exhibited the most potent virus neutralizing 

activity (IC 50 of 0.08 fig per ml) , possessed the lowest 
K d (2 .7 nM) . In contrast (Figure 7C, Table 6) , 
monoclonal antibody 1112/ the antibody that exhibited 
the weakest virus neutralizing activity (IC^ of 30 fig 

15 per ml) possessed the highest K d (20 nM) . Y^s for six 
additional CD4 -blocking monoclonal antibodies raised 
against MN-rgpl2 0 were also determined (Table 6) . It 
was found that monoclonal antibodies that possessed 
intermediate K d s similarly possessed intermediate 

20 neutralization IC 50 values . To explorie the relationship 
between virus neutr ali z ing activity and gpl20 binding 
affinity, the data in Table 6 was plotted in several 
different ways. It was found that when the K,, of the 
monoclonal antibodies was plotted as a function of the 

25 log of the IC^, a linear relationship was obtained 
(Figure 8) v Using this analysis a correlation 
coefficient (r) of 0.97) was obtained . Thus f this 
graph demonstrates that the virus neirtralizing activity 
of these monoclonal antibodies is directly pr opor t i ona 1 

30 to the gpl20 binding affinity/ and that the threshold 
for neutralization at this epitope is def ined by the 
: slope of the graph in Figure 8. 

A similar analysis was performed with the non- 
neutralizing CD4 blocking monoclonal antibodies to 

35 IIIB-rgpl20| 5C2 and 13H8 . The binding curve for 13H8 
(Figure 7C) showed that it bound to a sing 1 class of 
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sites on IIIB-rgpl20^ nM. The affinity 

of 5C2 could not be : de*e^ this assay b^caus^ at 

antibody conce^ 5 hM, non-- linear : 

(reduced gpl20 binding) was observed . This effect was 
5 suggestive steric hindrance at these concentrations or. 
negative cooperativity : between combining sites . This 
binding affinity was also determined for the non- 
neutralizing, non-CD4 blocking monoclonal antibody to 
MN-rgpl20, 1086. The fact that this antibody exhibited 
10 a binding affinity similar (9.7 nM) to many of the 
neutralizing monoclonal antibodies but failed to 
inhibit infectivity, proves that high antibody binding 
affinity alone is not sufficient for neutralization. 

15 Effect of C4 Domain Mutants on CD4 binding. 

Finally, the CD4 binding properties of the series of 
MN-rgpl20 mutants, constructed to localize the C4 
domain epitopes, were measured in a qualitative co- 
immunoprecipitation assay. In these studies the 

20 ability of the mutagenized WN-rgpl20 variants to co- 
immunoprecipitate CD4 was evaluated as described 
previously (21) in a qualitative co-immunoprecipitatibn 
assay similar to that described previously (19) . 
Briefly, 293 cells, trans fee ted with plasmids directing 

25 the expression of MN-rgpl2 6 variants described in 
Figure 5, were metabolically labeled with 
[ 35 S] -methionine , and the growth conditioned cell 
culture supernatants were incubated with rsCD4 . The 
resulting rsCD4 :gp!20 cdmplexes were then 

30 immunopr ec ip i t ated by addition of the CD4 specif ic 
monoclonal : antibody , 465 (A) or a positive control 
monoclonal antibody (1034) directed to the V3 domain of 
MN-rgpl20 (B) . The immunoprecipitated proteins were 
resolved by SDS-PAGE and visualized by autoradiography 

35 as described previously (3). The samples were: Lane 
1, MN.419A; lane 2, 1W. 421A; lan 3 , MN.429E; lane 4 , 
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"• MN. 429A; lane 5, MN.432A; lane 6/ ^.44 OA; lane 7, 
MN-rgpl20. The gel showed tha that b^ock 

; V antibody binding do hot block binding bf CD4. 

Therefore , the ant ibod ies do not bind to the gpl20 CD4- 
: '. 5 binding contact residues. This i^ steric 
hinderance may inhibit antibody binding, rather than 
that the antibodies bind directly to the CD4 contact 
residues to inhibit binding. 

It was found that all of the variants in which 
10 apolar A residue was substituted for the charged K or E 
residues (e.g. , MN.419A, MN.421A, MN.432A, and MN.440A) 
were still able to co-immunoprecipitate rsCD4. 
Similarly, the replacement of E for K at position 429 
(MN.429E), the replacement of F for I at position 423 
15 (MN.423F) or the mutant which incorporated both 

mutation (MN.423F,429E) also showed no reduction in 
their ability to co-immunoprecipitate rsCD4 . Thus , 
radical amino acid substitutions at five positions 
failed to affect the binding of gpl20 to CD4. These 
20 results were consistent with previous studies (5, 21, 
: . 34) where it was found that only a few of the many 
mutations that have been induced in this region 
effected CD4 binding. 

This study indicates that neutralizing epitopes in 
25 the C4 domain have now been found to be located between 
about residues 420 and 440 . In addition, the critical 
residues for antibody binding aire residues 429 and 432. 

EXAMPLE 2 

30 Identification of V2 Neutralizing Epitopes 

The procedures described in Example 1 were used to 
map epitopes in the V2 region of gpl2 0 . Table 7 
illustrates the results of mutagenicity studies to map 
V2 neutralizing epitopes . In the table , the columns 

35 indicate the comparison of binding of the monoclonal 
antibodies with wild type (WT) gp!20 in comparis n to 
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Various mutations of gpl20 using standard notation. 
For example > HGITIR" indicates that th glycine (G) at 
residue 171 has been replaced by an arginine (R) # 
"172A/173A" indicates that the residues at 172 and 173 
5 have been replaced by alanine. The neutralizing , 

monoclonal antibodies tested (MAbs) are listed in the 
rows . The numerical values in the table are the 
optical density Value, of an ELISA assay perf ormed as 
described in Example 1 to measure the amount of 
10 antibody binding. The underlined values indicate 
significantly reduced binding, indicating the 
substituted residue is critical for binding of the 
antibody. 



15 TABLE 7 . 

WT G171R, 172A/ E187V 187V/188S 

: M174V 173A 

■ MAbs . 

6E10 1.00 0.10 1.28 0.60 0.25 

1017 1.00;: Q. 70 1,10 0.87 0;04 

20 1022 1. 00 0.80 1.10 1.00 0.00 

1028 1 . 00 0 ; 9 0 1.18 1 . 07 0 .04 

1029 1.00 0;83 1.16 1,01 0.16 
1019 1.00 0.13 1.30 0.75 0.74 
1027 1.00 0 . 00 1.20 0.80 0. 64 

25 1025 1.00 0.69 0.00 0.00 0.83 

1088 1. 00 0 .73 1.12 0. 94 0. 03 

13H8 l.Ob 0.77 0.78 0.48 0.65 
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TABLE. 7 (continued) 







: WT : 


1//A . 




i ftfta ; : 






MAbs 














b£lQ 


i nn 
1*UU 




0 52 


0.64 


0 .43 


rr 

■ ■ D ■ 


1017 


!• UU . 


n 77 


0.77 


0.76 


0 . 11 




1022 


l • uu 


n ft fi 


0.72 


0 , 14 


o. bo 




■ : xU2o 


X • u u 


n Q v : 

w . 7 w . 


0 . 78 


0. 49 


0. 04 




1029 


1 .. 00 


r\ do 
O.oo . 


(Ji OD 




0.16 




1019 


1.00 


0.16 


0.00 


0.41 


0.44 


10 


1027 


1.00 


o.oo 


0 . 02 


0.41 


0.49 




1025 


1.00 


0.75 


0.0 


0.83 


0.72 




1088 


1.00 


0.77 


0.77 


0.53 


0.00 




13H8 


1 . 00 


0.72 


0.72 


0.53 


0.60 



15 As illustrated in Table 7, the study demonstrated 

that there are a series of overlapping neutralizing 
epitopes from been found to be located in the V2 region 
(residues 163 through 200) ., with most of the epitopes 
looat&X between residues 163 and 2 00 . In add it ion, the 

26 study indicates that the critical residues in the V2 

domain for antibody binding are residues 171, 173 , 174, 
177/ 181, 183, 187, and 188. 

EXAMPLE 3 

25; Immunization Studies 

gpl20 from the MN, GNE 8 , and GNE 16 strains of HIV 
was prepared by amplifying the gene from each isolate 
and cloning and expressing the gene in CHO cells as 
described in Berman et al. , J. Virol .66:4464-446? 

30 (1992) . Briefly, the gpl60 gene was amplified with two 
rounds of amplification using the following nested 
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primers according to the protocol by Kellog et al.> PP 
337^347 in PCR Protocols r : a guide to methods and ; 
: .' Innis et al. (eds. J Acad^ic Press; 

Ihc; : / New York. • 
.' : ; 5 • First round primers: 

: AATAATAGCAATAGTTGTGTGGWCC (W is A or T) 
ATTCTTTCCCTTAYAGTAGGCCATCC (Y is T or C) 
Second round primers: 

GGGAATTCGGATCCAGAGCAGAAGACAGTGGCAATGA 

10 GTCAAGAATTCTTATAG C AAAGCC CTTTCCAA 

The primers are SEQ. ID.' NQs. 31-34 . Each gene is then 
digested with the restriction endonucleases Kpnl and 
Accl. The resulting fragment was subcloned into the 
Bluescript (+) phagemid Ml 3 vector (Stratagene, Inc. j 

15 and sequenced by the dideoxynucleotide method (Sanger 
et al. , Proc. Natl . Acad. Sci. USA 74:5463-5467 
■ (1977)) . ' 

A fragment of the gp!20 coding region was then 
used to construct a chimeric gene for expression in 

20 mammalian cells, as described in Lasky et al. , Science 
223 : 2 09-212 (1986) . The 5 •• end was fused to a 
polylinker adjacent to a simian virus 40 (SV40) 
prompter and the 3 ' end was fused to a polylinker 
adjacent to the 3 »• untranslated sequences containing an 

25 SV40 poiyadiehylatiori sigh The expression vector 
(MN-r^pl2 0 ) was co^transf ected in CHO cells deficient 
in production of the enzyme dihydrof olate reductase , • 
alpng with a plasmid (pSVdhfr) containing a cDNA 
encoding the selectable marker, dihydrof olate 

30 reductase. Cell lines expressing MN-rgpl20 were 
isolated as described in Lasky et al.. Science 
223 : 209-212 (1986) . The recombinant glycoprotein was 
purified fro* growth-cbnditibhed cell culture medium by 
immunoaf f inity and ion exchange chromatography as 

35 described in Leonard et al. , J. Biol . Chem. 2 65 ; 10373- 
10382 (1990). 
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qt>12Q from the GNE 8 - and GNE^ strains of HIV is 
^re^ared in the same mariner as described v- : f.br^.-the-;MN. 
isolate.. ; :-:,v.:;. : ,. ■ 

■--'■^ (300 /xg/ injection) , GNE 8 -rgpl20 (300 Mg/ 

5 injects and GNE 16 -rgpl20 (300 Mg/ injection) are 
prepared in an aluminum hydroxide adjuvant (as 
described in Cordonnier et al*, Nature 340:571-574 
(1989)). Six chimpanzees are injected at 0, 4 , and 32 
weeks. Sera are collected and assayed for neutralizing 

10 antibody to each strain of HIV at the time of each 

immunization and three weeks thereafter. At 35 weeks > 
each of the chimpanzees has significant levels of 
neutralizing antibodies to each strain. 

At 35 weeks, the chimpanzees are randomly assigned 

15 to three groups. Each group is challenged with about 
10 50% chimpanzee-infectious doses (CID50) each of one 
of the vaccine isolates . One unimihunizeid; chimpanzee 
(control) is also injected with the same amount of 
virus as the immunized chimpanzees for each vaccine 

20 ; strain. 

Sera are drawn every two weeks throughout the : 
; study and assayed for antibodies to HIV core proteins 
and for the presence of HIV by PGR amplification and 
co-cultivation of peripheral blood mononuclear eel Is 

25 (PBMCs) from the chimpanzee together with activated 

human or chimpanzee 'PBMCs'- The presence of ahtibbdies 
to core proteins indicates the presence of viral 
infection as does the detection of amplif led viral DNA 
or viral infection of co-cultivated cells* 

30 the presence of virus is detected by PGR and 

co-cultivation methods in each unimmunized control 
animal between weeks 2 and 4 post challerigev 
Antibodies to core proteins appear in the control 
chimpanzees at six weeks post challenge. Neither virus 

35 not antibodies are at detectable levels in any of the 
immunized chimpanzees at one year post challenge, 
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indicating that the vaccine effectively protects the 
chimpanzees from infection from each of the challenge 

•strains:;:':'.::. 
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. SEQUENCE LISTING . -"J. 

( 1 ) GENERAL INFORMATION: 

: (i) APPLICANT: Herman > Phillip W. : 
' Nakamura^ Gerald R. - 

(ii^ ^ INVENTION: HIV Envelope Polypeptides 

(iii) NUMBER OF SEQUENCES: 26 

(Iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Skj erven, Morrill, MacPherson, Franklin & 

15 Friel 

(B) STREET: 25 Metro Drive Suite 700 

(C) CITY: San Jose 

(D) STATE: California 

(E) COUNTRY: U.S.A. 
20 (F) ZIP: 95110 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

25 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Pa tent In Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
30 (B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: Terlizzi, Laura 
35 (B) REGISTRATION NUMBER: 31,307 

(C) REFERENCE/DOCKET NUMBER: M-2820-1P 

( ix) TELECOMMUNICATION INFORMATION: ' 
(A) TELEPHONE: (408) 283-1222 
40 (B) TELEFAX: (408) 283-1233 

(2 ) INFORMATION: FOR SEQ ID NO: 1 : 



45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 511 amino acids 
•(B)- TYPE: amino acid 
(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

79 
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• Met Arg Val Lys Gly Tie Arg Arg Ash Tyr Gin His Trp Trp Gly Arg 

. a--.- .. 5 . 10 ' : ■ 15 

Gly Thr Met Leu Leu Gly Leu Leu Met lie Cys Ser Ala Thr: Glu Lys 

" ' : - Leu Trp Val Thr Vai 1^ Tyr Gly Val Pro Val Trp Lys : Glu Ala Thr 

;" : : ; 'vv : .v : ' ;\ : \-35 •' ■ : '. 40 ' • y 45 ;;/ : >v : : : : ;-:'. • 

: 10 Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Ala 

. ■ so • .. 55 60 



His Ash Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 
65 70 75 80 

15 

Gin Glu Val Glu Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp Lys 
85 90 95 

Asn Asn Met Val Glu Gin Met His Glu Asp lie lie Ser Leu Trp Asp 
20 100 105 110 

Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
115 120 125 

25 Asn Cys Thr Asp Leu Arg Asn : Thr Thr Asn Thr Asn Asn Ser Thr Asp 

' 130 ' • 135 . ' 140 

Asn Asn Asn Ser Lys Ser Glu Gly Thr lie Lys Gly Gly Glu Met Lys 
145 150 155 160 



30 ... . 

Asn Cys Ser Phe Asn lie Thr Thr Ser He Gly Asp Lys Met Gin Lys 
165 170 17 5 

Glu Tyr Ala Leu Leu Tyr Lys Leu Asp lie Glu Pro He Asp Asn Asp 
35 180 185 190 

Ser Thr Ser Tyr Arg Leu lie Cys Asn Thr Ser Val He Thr Gin 

195 ''J. - • 205 

40 Ala Cys Pro Lys lie Ser Phe Glu Pro lie Pro lie His Tyr Cys Ala 

210 V; .: 215 220 

Pro Ala Gly Phe Ala lie Leu Lys Cys Asn Asp Lys Lys Phe Ser Gly 
225 230 235 240 

45 

Lys Gly Ser Cys Lys Asn Val Ser Thr Val Gin Cys Thr His Gly lie 

245 250 . 255.;. 

Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Aia Glu 
50 260 265 270 

Glu Glu Val Val lie Arg Ser Glu Asp Phe Thr Asp Asn Ala Lys Thr 



80 
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275 •. 280 - ■ 285 

lie Tie Val His Leu Lys Glu Ser Val Gin lie Ash Cys Thr Arg Pro 

• 5 : : v;.V;v. : :. : ... ' . ' . . "• \ ' ... : . : . y ■ . . ' . . • : . ■ ' .' . '•, •' ■ :' ' V '■ ■ . . . 

Ash Tyr Asn Lys Arg Lys Arg lie His lie Gly Pro Gly Arg Ala Phe 

■-■■ : : ^:^--:^:^-^-:r ■ ;/ 310 : • 315, : 320 

Tyr Thr Thr Ly^ Ash lie Lys Gly Thr lie Arg Glh Ala His Cys lie 
10 325 , • 330 335 

lie Ser Arg Ala Lys Trp Asn Asp Thr Leu Arg Gin lie Val Ser Lys 
340 345 350 

15 Leu Lys Glu Gin Phe Lys Asn Lys Thr lie Val Phe Asn Pro Ser Ser 
355 360 365 

Gly : Gly Asp Pro Glu lie Val Met His Ser Phe Asn Cys Gly Gly Glu 

370 375 380 

20 

Phe Phe tyr Cys Asn Thr Ser Pro Leu Phe Ash Ser lie Trp Asn Gly 

385 390 395 ;: 400 

Asn Ash Thr Trp Asn Asn Thr Thr Gly Ser Asn Asn Asn lie Thr Leu 
25 405 410 415 

Gin Cys Lys lie Lys Gin lie Tie Asn Met Trp Glh Lys Val Gly Lys 
420 425 430 

30 Ala Met tyr Ala Pro Pro Tie Glu Gly Gin lie Arg Cys Ser Ser Asn 
435. ' 440 445 

lie thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Glu Asp thr Asp Thr 
450 455 460 
35 . ' "v---;-/ 

Ash Asp Thr Glu lie Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn 



465 : 470 475 4 80 

Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Thr lie Glu Pro Leu 

40 ; : 485'. ' ' 490 495 

Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu 

. •J./',;.'." 500, 505 ' 510 



(S£ INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 501 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Lys Tyr Ala Leu Ala Asp Ala Ser Leu Lys Met Ala Asp Pro Asn Arg 

' ;\.;.;5: : ;:;;.;:''-:.'-' : 10 .'-IS. : ■ 

5 Ph Arg Gly Lys Asp Leu: Pro Val Leu Asp Gin Leu Leu Glu Val Pro 

■ ••. 20 :/ •'v•.v•.v•:•v; : • : v-; : ::; : ^V: . ; ; .' 25 ; . : ' • . . • 3.0 

• Val Trp Lys Glu Ala Thr Thr Leu Phe Cys Al£ Ser Asp Ala Lys 

10 ■' 35 ■ -40 • ■ 45 . . 

Ala Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val 
50 55 60 

15 Pro Thr Asp Pro Asn Pro Glh Glu Val Glu Leu Val Asn Val Thr Glu 

65 7 9 y 75 80 

Asn , Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
85 90 95 

20 

lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
100 105 110 

Pro Leu Cys Val Thr Leu Asn Cys Thr Asp Leu Arg Asn Thr Thr Asn 
25 115 120 125 



Thr Ash Asn Ser Thr Asp Ash Asn Asn Ser Lys Ser Glu Gly Thr lie 
130 135 140 

30 Lys Gly Gly Glu Met Lys Asn Cys Ser Phe Asn lie Thr Thr Ser lie 
145 150 : 155 160 

Gly Asp Lys Met Gin Lys Glu Tyr Ala Leu Leu Tyr Lys Leu Asp lie 
165 170 175 

3 5 ■ 

Glu Pro lie Asp Asn Asp Ser Thr Ser Tyr Arg Leu lie Ser Cys Asn 
180 185 190 



Thr Ser Val lie Thr Gin Ala; Cys Pro Lys lie Ser Phe Glu Pro lie 

40 ■ ' 195 200 205 



Pro lie His Tyr Cys Ala Pro Ala Gly Phe Ala lie Leu Lys Cys Asn 
210 : 215 220 

4 5 Asp Lys Lys Phe Ser Gly Lys Gly Ser Cys Lys Asn Val Ser Thr Val 
225 230 235 240 

Gin Cys Thr His Gly lie Arg Pro Val Val Ser Thr Gin Leu Leu Leu 
245 250 255 

50 ; • ' \\- : . . ■ . ' • 

Asn Gly Ser Leu Ala Glu Glu Glu Val Val lie Arg Ser Glu Asp Phe 
260 265 270 

' 82 
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: Thr Asp Asn Ala Lys Thr lie lie Val His Leu Lys Glu Ser Val Gin 

: 275 ' ■ ' • ■ . ■ 280 . . -285 . : 

\:.^ite: : Asii -Cys- Thr Arg Pro Ash Tyr Asn Lys Arg Lys Arg lie His lie 

• •5 : . : .-.. v. : : : '. : . : : : ..2 : 9.0,- . . : 295 : \t.v ; ' #3- K" ■ ? 00 

Gly Pro Gly Arg Ala Phe Tyr Thr Thr Lys Asn life Lys Gly Thr lie 

•../■■.r 305v; ; f' -v- 310 . • ; 315:- : ;-v/; : .; _ : . .. 320 

10 Arg Gin Ala His Cys lie lie Ser Arg Ala Lys Trp Ash Asp Thr Leu 

; 325- • • 330 - ; 335.; 

Arg Gin lie Val Ser Lys Leu Lys Glu Gin Phe Lys Asn Lys Thr lie 
340 345 350 

■ 15 

Val Phe Asn Pro Ser Ser Gly Gly Asp Pro Glu Tie Val Met His Ser 
355 360 365 

Phe Ash Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Pro Leu Phe 
20 370 375 38Q : / 

Asn Ser He Trp Asn Gly Ash Asn Thr Trp Asn Asn Thr Thr Gly Ser 
385 390 395 400 

25 Asn Asn Ash lie Thr Leu Gin Cys Lys lie Lys Gin lie He Asn Met 

405 410 415 

Trp Gin Lys Val Gly Lys Ala Met Tyr Ala Pro Pro He Glu Gly Gin 
420 425 430 

■' • 30 : 

lie Arg Cys Ser Ser Asn lie Thr Gly Leu Leu Leu Thr Arg Asp Gly 
435 440 . 445 

Gly Glu Asp Thr Asp Thr Asn Asp Thir Glu lie Phe Arg Pro Gly Gly 

35 ' 450"" ' ; : '- 455 V ■ : • '. 460'-'; ' 

■ Giy Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val 

4 65 . ' 470 475 ' •• ' '. : ■ '480 . 

40 Val Thr lie Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg 

485 -. : 490 • . ' 495 ■ 

Val Val Gin Arg Glu 
500 

45, ' - - • ■ • . ' - '•. . . 

(2) INFORMATION FOR SEQ ID NO: 3 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28 amino acids 
50 (B) TYPE : amino acid 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 3 : 

Cvs Lys lie Lys Gin il£: : II ie Ash Met Trp Gin Lys Val Gly Lys Ala 

■ 5 ; 1 ' * ' : ' 5 ; . ■ • 10 . - . , ' . * 5 

Met Tyr Ala Pro Pro lie Glxi ■ Gly Gin lie Arg Cys 

' ■ - ' : - ::V v 20-.- -\ y: : '\^-y' : ''r r ' : • 

(2p INFORMATION FOR SEQ ID NO:4 - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid / 
15 (D) TOPOLOGY: linear ; 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

20 

Cys Arg lie Lys Gin Phe lie Asn Met Trp Gin Glu Val Gly Lys Ala 

1 5 ■ 10 ' 15 

Met Tyr Ala Pro Pro lie Ser Gly Gin lie Arg Cys 
25 20 . 25 

( 2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH : 28 amino acids 

(B) TYPE: amino acid 
•(D) TOPOLOGY : linear 



35 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5: 

Cys Arg lie Lys Gin He lie Asn Met Trp Gin Glu Val Gly Lys Ala 

1 ' ; 5 . 10 . 15 . 

40 ; • , : • ■■ V. : .f- V: : . . : : :i - ,'. 

Met Tyr Ala Pro Pro He Lys Gly Gin He Arg Cys 

20 ■ yyy- 25 

(2) INFORMATION FOR SEQ ID NO: 6 : 
45 • ■ •' . 'v.; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 



W094/28929 PCT/US94/06036 
( xi ) SEQUENCE DESCRIPTION : SEQ ID NO : 6 : 

Cys Arg lie Lys Gin lie lie Asn Met Trp Gin Gly Val Gly Lys Ala 

Met Tyr Ala Pro Pro lie Gl^ lie Asn Cys 

" ' ■ : ■ " : 20'f "■' : ^ : '-^r: : ^ ' ' \' '. 25 ■ : V. ■ ■ ".■ "A ■ . . . 

(2) INFORMATION FOR SEC} it) tib:7: 

. (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 2 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 : 

20 Cys Arg lie Lys Gin lie lie Asn Arg Trp Gin Glu Val Gly Lys Ala 
1 5 10 15 

lie Tyr Ala Pro Pro lie Ser Gly Gin lie Arg Cys 
20 25 

25 ' 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



35 (xi) SEQUENCE DESCRIPTION : SEQ ID NO:8: 

Cys Arg lie Lys Gin lie Val Asn Met Trp Gin Arg Val Gly Gin Ala 

• 1 . 5 1:0 ' ■■" 15 

4 0 Met Tyr Ala Pro Pro lie Lys Gly Val lie Lys Cys 

20 • 25. ... 

(2) INFORMATION FOR SEQ ID NO : 9: 

45 (i) SEQUENCE CHARACtMiSTICS 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: liriear 

so '. . ■ . ;• . • " ^ ..." . . •• ; ■ 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 9 : 
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Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Gly Ala Gly Gin Ala 

- • 5 . ' . 10 • 15 ■ 

. ; : Met Tyr Ala Pro Pro lie Ser Gly Thr lie Asn Cys r 

•:5;.:...V : ; : : ; ::;:': : : ;: : -:"-V • . .. '2-0 . 25 ' ^ ^ ^ ^[ ^ ^ 

(2) INFOP^TION FOR SEQ ID NO: 10 : 

( i) : SEQUENCE CHAI^CI^iStlCS : 
10 ; (A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
.•••.'(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Cys Arg Tie Lys Gin Phe lie Asn Met Trp Gin Glii Val Gly Lys Ala 
i 5 10 15 

20 " 

Met Tyr Ala Pro Pro lie Ser Gly Gin lie Arg Cys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 11: 

25 . : 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28 amino acids 
■(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 ■ 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 11: 

35 Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Lys Val Gly Lys Ala 

' 1 \ 5 - ■ .. 10 15. 

Met Tyr Ala Pro Pjto lie Ser Gly Gin lie Arg Cys : 
20 25 

40 . 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Cys Arg lie Lys Gin lie lie Asn Met Trp Glri Glu Val Gly Lys Ala 
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: i :,-5:' 10 

Met Tyr Ala Pro Pro lie Ser Gly Gin lie Arg Cys 

(2) INFORMATION FOR SEQ ID NO: 13 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
10 (B) TYPE: amino acid V 

■(D) TOPOLOGY: linear 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13 : 

Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Glu Val Gly Lys Ala 
1 5 10 15 

20 Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 

20 ; "', ; 25 

(2) INFORMATION FOR SEQ ID NO: 14: 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Ser Gly Gly Asp Pro Glu lie Val Met His Ser Phe Asn Cys Gly Gly 
35 1 5 ; 10 15 

Glu Phe Phe Tyr Cys Asn Thr Ser Pro Leu Phe Asn Ser lie Trp Asn 

• 20 • . 25 * ■ 30 

40 Gly Asn Asn Thr Trp Asn Asn Thr Thr Gly Ser Asn Asn Asn lie Thr 

. 35 ■ 40'. - 45 •• ■ 

Leu Gin Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Lys Val Gly 

50 ' 55 . 60 



45 • 

Lys Ala Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys Ser Ser 

65 70 75 ' '. : 80 



Asn lie Thr Gly Leu Leu Leu Thr Arg; Asp Gly Gly 
50 85 90 



(2) INFORMATION FOR SEQ ID NO: 15: 

87 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear: 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15: 

10 cys Lys lie Lys Glh lie lie Asn Met Trp Gin Glu Val Gly Lys Ala 

.'. i . 5 ■ 10 ■ ■ 15 

Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 

20 . - 25 

15 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



25 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 16: 

Cys Lys He Lys Gin lie lie Asn Met Trp Gin Ala Val Gly Lys Ala 

1 .-:5: " 10 ' " ' 15 ' 

3 0 Met Tyr Ala Pro Pro lie Glu Gly Gin He Arg Cys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 17 : 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 • '. - ■ '/ - - : ' : . . , / : ; ; ;: : y; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Cys Ala He Lys Gin lie He Asn Met Trp Gin Lys Val Gly Lys Ala 

45 1 . : 5 10 : 15 

Met Tyr Ala Pro Pro lie Glu Gly Gin He Arg Cys 

v 20 25 

(5P INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

88 
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(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid - ■ 

(D)- TOPOLOGY: linear-; 

(xi) SEQ^ q : y^ : r^ 

Cys Lys lie Ala Gin I le lie Asn Met Trp Gin Lys Val Gly Lys Ala 

10 vlv-v : ::/;:;. : V: : v v.'.'., - : 5 . ■ ; ' , 10 : ;\ 15 ; . . 

Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 

- : 20 25 

(2)5 INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

25 . '■. . • v..-,: 

Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Lys Vai Gly Ala Ala 

"■■ . 1 5 /. .' 10 = -: : 15 

Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 

30 . . 20 ... . - 25 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS : 
35 :(A) LENGTH: 28 amino acids 

•. (B) TYPE: amino acid 
(D) TOPOLOGY: linear : 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Lys Val Gly Lys Ala 

. ' 1 • ■ ' 5 " ■ '■ ' 10 - " ; :' 15 

45 

Met Tyr Ala Pro Pro lie Ala Gly Gin lie Arg Cys 

■'=■■■. .20 25 ■ 



(2) INFORMATION FOR SEQ ID NO: 21: 
50 • . . . 

(i) SEQUENCE CHARACTERISTICS : 

(A) : LENGTH: 28 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 21 : 

Cys Arg lie Lys Gin Phe lie Asn Met Trp Glri Glu Val Giy Lys Ala 

1 5 • ••;avvv.v.- : 10. ;: 15 

id : : .. '. . •': ; : :y : \r • ..^;y : ^: : Y;; ,;y ; - -v- ; • 

Met Tyr Ala Pro Pro lie Ser Gly Gin He Arg Cys 
20 • 25 . 

(2) INFORMATION FOR SEQ ID NO: 22: 

15 .. 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

25 Cys Lys lie Lys Glri Phe lie Asn Met Trp Gin Lys Val <Gly Lys Ala 

• . 1 ' 5 10 ■ . . 15 

Met Tyr Ala Pro Pro lie Glu Gly Gin He Arg Cys 
20 25 

30 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 28 amino acids 
3 5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Cys Lys lie Lys Gin Phe lie Asn Met Trp Gin Glu Val Gly Lys Ala 

5 1.0 ),.. r , . . 15 

45 Met Tyr Ala Pro Pro He Glu Gly Glri He Arg Cys 

20 . ■,25->;":;-V" : " : ' : 

(2) INFORMATION FOR SEQ ID NO: 24: 

50 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 



5 ( xi) SEQUENCE DES CRIPTION : -SEQ ID: NO: 24: 

Phe lie Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
10 lie Ser : 



(2) INFORMATION FOR SEQ ID NO: 25: 

15 (i) SEQUENCE CHARACTERISTICS : . . 

(A) LENGTH : 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro 
25 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 26 : ^ 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
: (D) TOPOLOGY: linear 

■ 35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Gly Lys Ala Met Tyr Ala Pro Pro lie Lys Gly Gin lie Arg 

: 1 5 V":10 . 

40 • 
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,HHAT:.IS CLAIMED IS:; 

A method for making an HIV gp!20 ^iibunit vaccine 
i . for a geographic region comprising the steps of : 
5 ; : a. determining a neutralizing epitope in the V2 
or C4 domain of gpl20 of HIV isolates from 
the geographic region; 

b. selecting an HIV strain having gp!20 which 
has a neutralizing epitope in the V2 or C4 

10 domain which is common among isolates in the 

geographic region; and 

c. making an HIV gpl20 subunit vaccine from the 
selected isolate. 

2v The method of Claim 1 wherein the: neutralizing 
15 epitope is determined by determining the amino 

: : acid sequence for at least a portion of the V2 or 

C4 domain. 

3. The method of Claim 2 wherein the amino acid 

sequence is determined by sequencing DNA encoding 
20 at least a portion of the V2 or C4 domain. 

4i The method of Claim 3 wherein a plurality of 

isolates having different amino acid sequences for 
the V2 and 04 domains are selected. 

5 . This method of Claim 4 wherein a plurality of 

25 isolates having different amino acid sequences for 

the V3 domain is selected. 

6. A method for making an HIV gpl20 subunit vaccine 
for a geographic region comprising the steps of: 

a. determining neutralizing epitopes for the V2, 
30 V3 f and C4 domains of gpl20 from HIV isolates 

;: from the geographic region; 

b. selecting at least two HIV isolates having 
different neutralizing epitopes in the V2, 

: 'v: ' V3^ or C4 domain; and 
35 : c. making an HIV gpl20 subunit vaccine; from the 

■■'y ■■:'■[.:" : : selected isolates. 
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7 . The method of Claim 6 wherein each of the selected : ;. 
. isolates have one of the most common neutralizing 

' : :'.-'.,epitOpeS^ :; :-- • 

8. ; A method; for making an HIV gpl20 subunit vaccine 
5 : : for a geographic region comprising the steps of: 

a. determining the neutralizing epitopes for HIV 
isolates and the percentage of HIV infections 
attributable to each strain present in the 
region; 

10 b. selecting at least two HIV strains which have 

the most common neutralizing epitopes in the 
V2, V3V and C4 domains in the geographic 
region; and: 

c. making an HIV gpl20 subunit vaccine from the 
15 selected isolates, 

9. The method of Claim 8 wherein the isolates are 
primary patient isolates. 

10. The method of Claim 8 wherein the geographic 
region is North America and the amino acid 

20 sequence of gpl20 from the HIV isolates MN and GNE g 

are selected. 

11. The method of Claim 10 wherein the GNE 16 isolate is 
also selected . 

12. A mu 1 1 i va 1 en t HIV gpl20 subunit vaccine. 

25 13 . The vaccine of Claim 12 wherein gpl20 present in 
the vaccine is from at least two HIV isolates 
which have a different neutralizing epitope in the 
V2 or C4 domain of gpl20. 

14. The vaccine of Claim 12 wherein gp!20 present in 
30 the vaccine is from at least two HIV isolates 

which have a different neutralizing epitope in the 
V3 domain of gpl20. 

15. The vaccine of Claim 12 wherein each isolate has a 
different common neutralizing epitope for the V2 

35 or C4 domains of gpl20. 
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16. The vaccine of Claim ^1 present in 
the vaccine is from the; MN:and GNE 8 strains of HIV. 

17. The vaccine of Claim l wherein gp!20 from the GNE J6 
strain of HIV is also present in the vaccine* 

5 18. A DNA sequence of less than 5 kilobases encoding 
gpl20 from GNE 8 and having the nucleotide sequence 
illustrated in Table i. ; 

19. A DNA sequence of less than 5 kilobases encoding 
gpizo from GNE 16 and having the nucleotide sequence 

10 illustrated in Table 2. 

20. An expression construct comprising DNA encoding 
gp!20 selected from the group consisting of 
GNE 8 -gpl20 and GNE 16 -gpl20 under the transcriptional 
and translation*! control of a heterologous 

15 promoter. 

21. The expression construct of Claim 20 wherein the 
promoter is a eukary otic promoter. 

22. The expression construct of Claim 21 wherein the 
DNA encoding gpl20 is joined to a heterologous 

20 signal sequence. 

23. An isolated GNE 8 -gpl20 polypeptide having the amino 
acid sequence illustrated in Table 1. 

24. An isolated GNE^-gplZp polypeptide having the 
amino acid sequence illustrated in Table 2. 

25 25. An improved serotyping method for HIV strains 
comprising determining the serotypes of the V2, 
V3, and C4 domains of gpl20. 
26. A truncated gpl20 sequence which sequence is free 
from the C5 domain. 
30 27. The truncated gpl20 sequence of Claim 26 wherein 

the sequence is additionally free from the car boxy 
: terminus through amino acid residue 453 of the 
gpl20 V5 domain. 
28. The truncated gpl20 sequence of Claim 27 wherein 
35 the sequence is additionally free from the gpl20 

V5 domain. 
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29 . The truncated gpi2 0 sequence of Claim 26 wherein 
the sequence is addi t i oha 1 ly free from the gpl20 
: . ; s i gna 1 sequence . ■ • ■ 

30 . The truncated gp 12 0 sequence of Claim 29 wherein 
5 : : the silence ^ from the car boxy 

terminus thrpugh^ residue ill of the 

gp!20 CI doiain. 
31 # The truncated gp!20 sequence of Claim 29 wherein 

the sequence^ is additionally free from the carboxy 
10 terminus through amino acid residue 117 of the 

gpl20 CI domain. 

32. The truncated gpl2 0 sequence of Claim 26 wherein 
the sequence is free from the amino terminus of 
gp!20 through residue 111 of the CI domain and 

15 residue 453 through the carboxy terminus of gpl20. 

33. The truncated gp!2 0 sequence of Claim 26 wherein 
the sequence is produced by recombinant 
engineering . : : 
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